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1  FORWARD  BY  PROF.  MICHAEL  J.  S.  DEWAR 


"MOPAC  is  the  present  culmination  of  a 
continuing  project  that  started  twenty  years  ago, 
directed  to  the  development  of  quantum  mechanical 
procedures  simple  enough,  and  accurate  enough  ,  to 
be  useful  to  chemists  as  an  a id  in  their  own 
research.  A  historical  account  of  this 
development,  with  references,  has  appeared  [1]. 
The  first  really  effective  treatment  was  MINDO/3 
[2],  which  is  still  useful  in  various  areas  of 
hydrocarbon  chemistry  but  ran  into  problems  with 
heteroatoms.  This  was  succeeded  by  MNDO  [3]  and 
more  recently  by  AMI  [ 4 ]  which  seems  to  have 
overcome  most  of  the  deficiencies  of  its 
predecessors  at  no  cost  in  computing  time. 

Our  computer  programs  steadily  evolved  with 
the  development  of  new  algorithms.  In  addition  to 
the  basic  programs  for  the  SCF  calculations  and 
geometry  optimization,  programs  were  developed  for 
calculating  vibration  frequencies  [5], 
thermodynamic  parameters  [6],  kinetic  isotope 
effects  17],  linear  polymers  [8],  polarizabilities 
and  hyperpolarizabilities  [9,10],  and  SCF-CI 
calculations  [11].  While  this  disjointed 
collection  of  programs  served  its  purpose,  it  was 
inconvenient  and  time  consuming  to  use.  A  major 
step  was  the  integration  [12]  of  most  of  these 
into  a  single  unified  program  [MOPAC]  with  a 
greatly  simplified  input.  The  individual  programs 
were  also  rewritten  in  a  more  efficient  form  so 
that  the  computing  time  reported  for  most 
calculations  has  now  been  halved.  In  its  present 
form  MOPAC  is  impressively  easy  to  use  and  it 
contains  options  for  nearly  all  the  applications 
where  our  procedures  have  been  found  useful." 


Michael  J.S.  Dewar,  January  1987 
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2  PREFACE  TO  SIXTH  EDITION,  VERSION  6.00 

As  indicated  at  the  time  of  release  of  MOPAC  5.00,  there  has 
been  a  gap  of  two  years'  duration.  It  is  likely  that  a  second  gap 
of  two  years'  duration  will  follow  this  release. 

The  main  change  from  a  user's  point  of  view  (who  else  matters?) 
in  MOPAC  6.00  has  been  that  MOPAC  now  runs  faster  than  before.  In 
addition,  the  range  of  PM3  and  AMI  elements  is  increased.  Finally, 
some  bells  and  whistles  have  been  added,  such  as  Gaussian  Z-matrix 
input  and  output,  polymer  electronic  and  phonon  band  structures  and 
densities  of  states.  Experienced  users  should  refer  to  the  'update 
release  notes'  for  a  concise  description  of  all  modifications. 


3  UPDATES  FROM  VERSION  5.00 


Except  for  MOPAC  5.00,  MOPAC  has  been  updated  once  a  year. 
This  is  the  best  compromise  between  staying  current  and  asking  users 
to  continuously  change  their  software.  Updates  may  be  obtained  from 
QCPE  at  the  same  cost  as  the  original,  or  from  sites  that  have  a 
current  copy.  All  VAX  versions  of  MOPAC  have  the  same  QCPE  number  - 
455;  they  are  distinguished  by  version  numbers.  Users  are 
recommended  to  update  their  programs  at  least  once  every  two  years, 
and  preferably  every  year. 

New  Features  of  Version  6.0 

1.  PM3  has  been  extended  to  include  Be,  Mg,  Zn,  Ga,  Ge,  As,  Se,  Cd, 
In,  Sn,  Sb,  Te,  Hg,  Tl,  Pb,  and  Bi . 

2.  Changes  to  the  IRC/DRC 

1.  The  amount  of  output  has  been  reduced.  The  'missing'  output 
can  be  printed  by  using  appropriate  keywords. 

2.  Half-lives  are  now  accurately  generated.  Earlier  versions 
had  a  small  error  due  to  calculation  start-up.  Both 
positive  and  negative  half-lives  are  now  accurate. 


3.  The  energy  partition  output  has  been  rewritten  so  that  all  terms 
having  to  do  with  each  diatomic  pair  are  now  printed  on  one 
line. 

4.  A  LOG  file  will  normally  be  generated.  This  is  intended  to  be 
read  while  the  calculation  is  running.  The  LOG  file  can  be 
suppressed  by  the  user. 

5.  Elements  can  be  labeled  with  up  to  six  alphanumeric  characters. 
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6.  Gaussian  Z-matrices  can  be  input  and  printed  (in  the  ARC  file). 

7.  Multiple  data-sets  can  be  run  in  one  job. 

8.  Up  to  three  lines  of  keywords  can  be  specified. 

9.  The  DEBUG,  1SCP,  and  C.I.  keywords  have  been  re-defined. 

10.  An  Eigenvector  Following  option  has  been  added. 

11.  Polymer  electronic  band  structure  and  density  of  states  added. 

12.  Polymer  phonon  band  structure  and  density  of  states  added. 

13.  The  GRID  option  has  been  rewritten. 

14.  The  PATH  option  has  been  extended. 


KEYWORDS  ADDED  IN  VERSION  6.00 

In  going  to  Version  6.00,  many  keywords  were  added.  These  are 


defined 

keywords 

fully  later 
follows: 

on.  The 

complete 

set  of  new  or  modified 

& 

+ 

AIDER 

AIGIN 

AIGOUT 

DIPOLE 

DIPX 

DIPZ 

DIPZ 

DMAX=n . n 

EF 

EIGINV 

ESP 

ESPRST 

HESS=n 

IUPD=n 

K=(n.nn,n) 

MODE=n 

MS=n 

NOANCI 

NODUS 

NOLOG 

NONR 

NOTH I EL 

NSURF 

OLDGEO 

ORIDE 

POINT 

POINTl=n 

POINT2=n 

POTWRT 

RECALC=n 

SCALE 

SCFCRT= 

SCINCR 

SETUP 

ST03G 

SETUP=name 

TS 

SLOPE 

WILLIAMS 

STEP 

SYMAVG 

KEYWORDS  DROPPED  FROM  VERSION  5.00 

FULSCF  Reason:  Line  searches  now  always  involve  full  SCF 

calculations.  The  frozen  density  matrix  option  is  no  longer 
supported. 

CYCLES=n  Reason:  The  maximum  number  of  cycles  is  now  not  defined. 
Users  should  control  jobs  via  "  t=n.nn" . 


ERRORS  CORRECTED  IN  VERSION  5.0 

1.  Force  constants  and  frequencies  calculated  using  non-variationally 
optimized  wavef unctions  were  faulty. 

2.  A  full  keyword  line  (no  extra  spaces)  would  be  corrupted  if  the  first 
character  was  not  a  space. 
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3.  PRECISE  FORCE  calculations  on  triatomics  had  spuriously  large  trivial 
vibrations. 

4.  FORCE  calculations  with  many  more  hydrogen  atoms  than  MAXLIT  would 
fail  to  generate  force  constants  or  normal  coordinates. 

5.  The  EXTERNAL  option  was  limited  to  AMI. 

6.  Vibrational  transition  dipoles  were  in  error  by  about  30%. 

7.  The  reformation  of  the  density  matrix  when  a  non-variationally 
optimized  wavef unction  was  used  was  incomplete. 


Help  with  MOPAC 


Telephone  and  mail  support  is  given  by  the 
Frank  J.  Seiler  Research  Laboratory  on  a  time 
permitting  basis.  If  you  need  help,  call 
the  Seiler  MOPAC  Consultant  at 

(719)  472-2655 

Similarly,  mail  should  be  addressed  to 

MOPAC  Consultant 
FJSRL/NC 

U.S.  Air  Force  Academy  CO  80840-6528 
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CHAPTER  1 


DESCRIPTION  OF  MOPAC 


MOPAC  is  a  general-purpose  semi-empirical  molecular  orbital  package 
for  the  study  of  chemical  structures  and  reactions.  The  semi-empirical 
Hamiltonians  MNDO,  MINDO/3,  AMI,  and  PM3  are  used  in  the  electronic  part 
of  the  calculation  to  obtain  molecular  orbitals,  the  heat  of  formation 
and  its  derivative  with  respect  to  molecular  geometry.  Using  these 
results  MOPAC  calculates  the  vibrational  spectra,  thermodynamic 
quantities,  isotopic  substitution  effects  and  force  constants  for 
molecules,  radicals,  ions,  and  polymers.  For  studying  chemical 
reactions,  a  transition-state  location  routine  and  two  transition  state 
optimizing  routines  are  available.  For  users  to  get  the  most  out  of  the 
program,  they  must  understand  how  the  program  works,  how  to  enter  data, 
how  to  interpret  the  results,  and  what  to  do  when  things  go  wrong. 

While  MOPAC  calls  upon  many  concepts  in  quantum  theory  and 
thermodynamics  and  uses  some  fairly  advanced  mathematics,  the  user  need 
not  be  familiar  with  these  specialized  topics.  MOPAC  is  written  with  the 
non-theoretician  in  mind.  The  input  data  are  kept  as  simple  as  possible 
so  users  can  give  their  attention  to  the  chemistry  involved  and  not 
concern  themselves  with  quantum  and  thermodynamic  exotica. 

The  simplest  description  of  how  MOPAC  works  is  that  the  user  creates 
a  data-file  which  describes  a  molecular  system  and  specifies  what  kind  of 
calculations  and  output  are  desired.  The  user  then  commands  MOPAC  to 
carry  out  the  calculation  using  that  data-file.  Finally  the  user 
extracts  the  desired  output  on  the  system  from  the  output  files  created 
by  MOPAC. 


NOTES  (1)  This  Is  the  "sixth  edition”.  MOPAC  has  undergone  a  steady 
expansion  since  its  first  release,  and  users  of  the  earlier  editions  are 
recommended  to  familiarize  themselves  with  the  changes  which  are 
described  in  this  manual.  If  any  errors  are  found,  or  if  MOPAC  does  not 
perform  as  described,  please  contact  Dr.  James  J.  P.  Stewart, 
Frank  J.  Seiler  Research  Laboratory,  U.S.  Air  Force  Academy,  Colorado 
Springs,  CO  80840-6528.  (2)  MOPAC  runs  successfully  on  normal  CDC,  Data 
General,  Gould,  and  Digital  computers,  and  also  on  the  CDC  205  and 
CRAY-XMP  "supercomputers”.  The  CRAY  version  has  been  partly  optimized  to 
take  advantage  of  the  CRAY  architecture.  Several  versions  exist  for 
microcomputers  such  as  the  IBM  PC-AT  and  XT,  Zenith,  etc. 
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1.1  SUMMARY  OF  MOPAC  CAPABILITIES 

1.  MNDOr  MINDO/3 ,  AMI,  and  PM 3  Hamiltonians. 

2.  Restricted  Hartree-Fock  (RHF)  and  Unrestricted  Hartree-Fock 
(UHF)  methods. 

3.  Extensive  Configuration  Interaction 

1.  100  configurations 

2.  Singlets,  Doublets,  Triplets,  Quartets,  Quintets,  and 
Sextets 

3.  Excited  states 

4.  Geometry  optimizations,  etc.,  on  specified  states 

4.  Single  SCF  calculation 

5.  Geometry  optimization 

6.  Gradient  minimization 

7.  Transition  state  location 

8.  Reaction  path  coordinate  calculation 

9.  Force  constant  calculation 

10.  Normal  coordinate  analysis 

11.  Transition  dipole  calculation 

12.  Thermodynamic  properties  calculation 

13.  Localized  orbitals 

14.  Covalent  bond  orders 

15.  Bond  analysis  into  sigma  and  pi  contributions 

16.  One  dimensional  polymer  calculation 

17.  Dynamic  Reaction  Coordinate  calculation 

18.  Intrinsic  Reaction  Coordinate  calculation 
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1.2  COPYRIGHT  STATUS  OF  MOPAC 

At  the  request  of  the  Air  Force  Academy  Law  Department  the  following 
notice  has  been  placed  in  MOPAC. 

Notice  of  Public  Domain  nature  of  MOPAC 

'This  computer  program  is  a  work  of  the  United  States 
Government  and  as  such  is  not  subject  to  protection  by 
copyright  (17  U.S.C.  #  105.)  Any  person  who  fraudulently 
places  a  copyright  notice  or  does  any  other  act  contrary 
to  the  provisions  of  17  U.S.  Code  506(c)  shall  be  subject 
to  the  penalties  provided  therein.  This  notice  shall  not 
be  altered  or  removed  from  this  software  and  is  to  be  on 
all  reproductions.' 

I  recommend  that  a  user  obtain  a  copy  by  either  copying  it  from  an 
existing  site  or  ordering  an  'official'  copy  Erora  the  Quantum  Chemistry 
Program  Exchange,  (QCPE),  Department  of  Chemistry,  Indiana  University, 
Bloomington,  Indiana,  47405.  The  cost  covers  handling  only.  Contact  the 
Editor,  Richard  Counts,  at  (812)  855-4784  for  further  details. 


1.3  PORTING  MOPAC  TO  OTHER  MACHINES 

MOPAC  is  written  for  the  DIGITAL  VAX  computer.  However,  the  program 
has  been  written  with  the  idea  that  it  will  be  ported  to  other  machines. 
After  such  a  port  has  been  done,  the  new  program  should  be  given  the 
version  number  6.10,  or,  if  two  or  more  versions  are  generated,  6.20, 
6.30,  etc.  To  validate  the  new  copy,  QCPE  has  a  test-suite  of 
calculations.  If  all  tests  are  passed,  within  the  tolerances  given  in 
the  tests,  then  the  new  program  can  be  called  a  valid  version  of  MOPAC  6. 
Insofar  as  is  practical,  the  mode  of  submission  of  a  MOPAC  job  should  be 
preserved,  e.g., 

(prompt)  MOPAC  <data-set>  [<queue-options> . . . ) 

Any  changes  which  do  not  violate  the  FORTRAN-77  conventions,  and 
which  users  believe  would  be  generally  desirable,  can  be  sent  to  the 
author . 


1.4  RELATIONSHIP  OF  AMPAC  AND  MOPAC 


In  1985  MOPAC  3.0  and  AMPAC  1.0  were  submitted  to  QCPE  for 
distribution.  At  that  time,  AMPAC  differed  from  MOPAC  in  that  it  had  the 
AMI  algorithm.  Additionally,  changes  in  some  MNDO  parameters  in  AMPAC 
made  AMPAC  results  incompatable  with  MOPAC  Versions  1-3.  Subsequent 
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versions  o£  MOPAC ,  in  addition  to  being  more  highly  debugged  than  Version 
3.0,  also  had  the  AMI  method.  Such  versions  were  compatable  with  AMPAC 
and  with  versions  1-3  of  MOPAC. 

In  order  to  avoid  confusion,  all  versions  of  MOPAC  after  3.0  include 
journal  references  so  that  the  user  knows  unambiguously  which  parameter 
sets  were  used  in  any  given  job. 

Since  1985  AMPAC  and  MOPAC  have  evolved  along  different  lines.  In 
MOPAC  I  have  endeavoured  to  provide  a  highly  robust  program,  one  with 
only  a  few  new  features,  but  which  is  easily  portable  and  which  can  be 
relied  upon  to  give  precise,  if  not  very  exciting,  answers.  At  Austin, 
the  functionality  of  AMPAC  has  been  enhanced  by  the  research  work  of 
Prof.  Dewar's  group.  The  new  AMPAC  2.1  thus  has  functionalities  not 
present  in  MOPAC.  In  publications,  users  should  cite  not  only  the 
program  name  but  also  the  version  number. 

Commercial  concerns  have  optimized  both  MOPAC  and  AMPAC  for  use  on 
supercomputers.  The  quality  of  optimization  and  the  degree  to  which  the 
parent  algorithm  has  been  preserved  differs  between  MOPAC  and  AMPAC  and 
also  between  some  machine  specific  versions.  Different  users  may  prefer 
one  program  to  the  other,  based  on  considerations  such  as  speed.  Some 
modifications  of  AMPAC  run  faster  than  some  modifications  of  MOPAC,  and 
vice  versa,  but  if  these  are  modified  versions  of  MOPAC  3.0  or  AMPAC  1.0, 
they  represent  the  programming  prowess  of  the  companies  doing  the 
conversion,  and  not  any  intrinsic  difference  between  the  two  programs. 

Testing  of  these  large  algorithms  is  difficult,  and  several  times 
users  have  reported  bugs  in  MOPAC  or  AMPAC  which  were  introduced  after 
they  were  supplied  by  QCPE. 

Cooperative  Development  of  MOPAC 

MOPAC  has  developed,  and  hopefully  will  continue  to  develop,  by  the 
addition  of  contributed  code.  As  a  policy,  any  supplied  code  which  is 
incorporated  into  MOPAC  will  be  described  in  the  next  release  of  the 
Manual,  and  the  author  or  supplier  acknowledged.  In  the  following 
release  only  journal  references  will  be  retained.  The  objective  is  to 
produce  a  good  program.  This  is  obviously  not  a  one-person  undertaking; 
if  it  was,  then  the  product  would  be  poor  indeed.  Instead,  as  we  are  in 
a  time  of  rapid  change  in  computational  chemistry,  a  time  characterized 
by  a  very  free  exchange  of  ideas  and  code,  MOPAC  has  been  evolving  by 
accretion.  The  unstinting  and  generous  donation  of  intellectual  effort 
speaks  highly  of  the  donors.  However,  with  the  rapid  commercialization 
of  computational  chemistry  software  in  the  past  few  years,  it  is 
unfortunate  but  it  seems  unlikely  that  this  idyllic  state  will  continue. 
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1.5  PROGRAMS  RECOMMENDED  FOR  USE  WITH  MOPAC 


MOPAC  is  the  core  program  of  a  series  of  programs  for  the  theoretical 
study  of  chemical  phenomena.  This  version  is  the  sixth  in  an  on-going 
development,  and  efforts  are  being  made  to  continue  its  further 
evolution.  In  order  to  make  using  MOPAC  easier,  five  other  programs  have 
also  been  written.  Users  of  MOPAC  are  recommended  to  use  all  four 
programs.  Efforts  will  be  made  to  continue  the  development  of  these 
programs . 

HELP 

HELP  is  a  stand-alone  program  which  mimics  the  VAX  HELP  function. 
It  is  intended  for  users  on  UNIX  computers.  HELP  comes  with  the  basic 
MOPAC  6.00,  and  is  recommended  for  general  use. 

DRAW 

DRAW,  written  by  Maj.  Donn  Storch,  USAF,  and  available  through  QCPE, 
is  a  powerful  editing  program  specifically  written  to  interface  with 
MOPAC.  Among  the  various  facilities  it  offers  are: 

1.  The  on-line  editing  and  analysis  of  a  data  file,  starting  from 
scratch  or  from  an  existing  data  file,  an  archive  file,  or  from 
a  results  file. 

2.  The  option  of  continuous  graphical  representation  of  the  system 
being  studied.  Several  types  of  terminals  are  supported, 
including  DIGITAL,  TEKTRONIX,  and  TERAK  terminals. 

3.  The  drawing  of  electron  density  contour  maps  generated  by 
DENSITY  on  graphical  devices. 

4.  The  drawing  of  solid-state  band  structures  generated  by  MOSOL. 

5.  The  sketching  of  molecular  vibrations,  generated  by  a  normal 
coordinate  analysis. 

DENSITY 

DENSITY,  written  by  Dr.  James  J.  P.  Stewart,  and  available  through 
QCPE,  is  an  electron-density  plotting  program.  It  accepts  data-files 
directly  from  MOPAC,  and  is  intended  to  be  used  for  the  graphical 
representation  of  electron  density  distribution,  individual  M.O.'s,  and 
difference  maps. 

MOHELP 

MOHELP,  also  available  through  QCPE,  is  an  on-line  help  facility, 
written  by  Maj.  Donn  Storch  and  Dr.  James  J.  P.  Stewart,  to  allow  non-VAX 
users  access  to  the  VAX  HELP  libraries  for  MOPAC,  DRAW,  and  DENSITY. 
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MOSOL 

MOSOL  (Distributed  by  QCPE)  is  a  full  solid-state  HNDO  program 
written  by  Dr.  James  J.  P.  Stewart.  In  comparison  with  MOPACf  MOSOL  is 
extremely  slow.  As  a  result,  while  geometry  optimization,  force 
constants,  and  other  functions  can  be  carried  out  by  MOSOL,  these  slow 
calculations  are  best  done  using  the  solid-state  facility  within  MOPAC. 

|  MOSOL  should  be  used  for  two  or  three  dimensional  solids  only,  a  task 
that  MOPAC  cannot  perform. 


1.6  THE  DATA-FILE 

This  section  is  aimed  at  the  complete  novice  —  someone  who  knows 
nothing  at  all  about  the  structure  of  a  MOPAC  data-file. 

First  of  all,  there  are  at  most  four  possible  types  of  data-files 
for  MOPAC,  but  the  simplest  data-file  is  the  most  commonly  used.  Rather 
than  define  it,  two  examples  are  shown  below.  An  explanation  of  the 
geometry  definitions  shown  in  the  examples  is  given  in  the  chapter 
"GEOMETRY  SPECIFICATION" . 


1.6.1  Example  Of  Data  For  Ethylene 


Line 

1 

UHF  PULAY  MIND03 

VECTORS  DENSITY 

LOCAL  T=300 

Line 

2 

EXAMPLE  OF  DATA 

FOR  MOPAC 

Line 

3 

MINDO/3  UHF  CLOSED-SHELL 

D2D 

ETHYLENE 

Line 

4a 

C 

Line 

4b 

C 

1.400118 

1 

Line 

4c 

H 

1.098326 

1 

123.572063 

1 

Line 

4d 

H 

1.098326 

1 

123.572063 

1 

180.000000 

0 

2 

1 

3 

Line 

4e 

H 

1.098326 

1 

123.572063 

1 

90.000000 

0 

1 

2 

3 

Line 

4f 

H 

1.098326 

1 

123.572063 

1 

270.000000 

0 

1 

2 

3 

Line 

5 

As  can  be  seen,  the  first  three  lines  are  textual.  The  first  line 
consists  of  keywords  (here  seven  keywords  are  shown).  These  control  the 
calculation.  The  next  two  lines  are  comments  or  titles.  The  user  might 
want  to  put  the  name  of  the  molecule  and  why  it  is  being  run  on  these  two 
lines. 


These  three  lines  are  obligatory,  if  no  name  or  comment  is  wanted, 
leave  blank  lines.  If  no  keywords  are  specified,  leave  a  blank  line.  A 
common  .rror  is  to  have  a  blank  line  before  the  keyword  line:  this  error 
is  quite  tricky  to  find,  so  be  careful  not  to  have  four  lines  before  the 
start  of  the  geometric' data  (lines  4a-4f  in  the  example).  Whatever  is 
decided,  the  three  lines,  blank  or  otherwise,  are  obligatory. 
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In  the  example  given,  one  line  of  keywords  and  two  of  documentation 
are  shown.  By  use  of  keywords,  these  defaults  can  be  changed.  Modifying 
keywords  are  +,  &,  and  SETUP.  These  are  defined  in  the  KEYWORDS  chapter. 

The  following  table  illustrates  the  allowed  combinations: 


Line 

1 

Line 

2 

Line  3 

Line  4 

Line  5 

Setup  used 

Keys 

Text 

Text 

Z-matr ix 

Z-matrix 

not  used 

Keys 

+ 

Keys 

Text 

Text 

Z-matrix 

not  used 

Keys 

+ 

Keys 

+ 

Keys 

Text 

Text 

not  used 

Keys 

& 

Keys 

Text 

Z-matrix 

Z-matrix 

not  used 

Keys 

& 

Keys 

& 

Keys 

Z-matrix 

Z-matrix 

not  used 

Keys 

SETUP 

Text 

Text 

Z-matrix 

Z-matrix 

1  or  2  lines 

Keys 

+ 

Keys 

SETUP 

Text 

Text 

Z-matrix 

1  line  used 

Keys 

& 

Keys 

SETUP 

Text 

Z-matrix 

Z-matrix 

1  line  used 

used 


No  other  combinations  are  allowed. 


The  proposed  use  of  the  SETUP  option  is  to  allow  a  frequently 
used  set  of  keywords  to  be  defined  by  a  single  keyword.  For 
example,  if  the  default  criteria  are  not  suitable,  SETUP  might 
contain 

"  SCFCRT=l.D-8  SHIFT=30  ITRY=600  GNORM=0.02  ANALYT  " 

II  II 

The  order  of  usage  of  a  keyword  is  Line  1  >  Line  2  >  Line  3.  Line  1 
>  SETUP.  Line  2  >  SETUP.  SETUP  >  built  in  default  values. 


The  next  set  of  lines  defines  the  geometry.  In  the  example, 
the  numbers  are  all  neatly  lined  up;  this  is  not  necessary,  but  does 
make  it  easier  when  looking  for  errors  in  the  data.  The  geometry  is 
defined  in  lines  4a  to  4f;  line  5  terminates  both  the  geometry  and 
the  data-file.  Any  additional  data,  for  example  symmetry  data, 
would  follow  line  5. 


Summarizing,  then,  the  structure  for  a  MOPAC  data-file  is: 


Line  1: 
Line  2: 

Line  3: 
Lines  4: 

Line  5: 


Keywords.  (See  chapter  2  on  definitions  of  keywords) 
Title  of  the  calculation,  e.g.  the  name  of  the 
molecule  or  ion. 

Other  information  describing  the  calculation. 
Internal  or  cartesian  coordinates  (See  chapter  on 
specification  of  geometry) 

Blank  line  to  terminate  the  geometry  definition. 


Other  layouts  for  data-files  involve  additions  to  the  simple 
layout.  These  additions  occur  at  the  end  of  the  data-file,  after 
line  5.  The  three  most  common  additions  are: 


(a)  Symmetry  data:  This  follows  the  geometric  data,  and  is 
ended  by  a  blank  line. 
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(b)  Reaction  path:  After  all  geometry  and  symmetry  data  (if 
any)  are  read  in,  points  on  the  reaction  coordinate  are  defined. 

(c)  Saddle  data:  A  complete  second  geometry  is  input.  The 
second  geometry  follows  the  first  geometry  and  symmetry  data  (if 
any) . 


1.6.2  Example  Of  Data  For  Polytetrahydrofuran 

The  following  example  illustrates  the  data  file  for  a  four  hour 
polytetrahydrofuran  calculation.  As  you  can  see  the  layout  of  the 
data  is  almost  the  same  as  that  for  a  molecule,  the  main  difference 
is  in  the  presence  of  the  translation  vector  atom  "Tv". 


Line  1  :T=4H 

Line  2  :  POLY-TETRAHYDROFURAN  (C4  H8  0)2 

Line  3  : 


Line 

4a 

C 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

Line 

4b 

C 

1.551261 

1 

0.000000 

0 

0.000000 

0 

1 

0 

0 

Line 

4c 

0 

1.401861 

1 

108.919034 

1 

0.000000 

0 

2 

1 

0 

Line 

4d 

C 

1.401958 

1 

119.302489 

1 

-179.392581 

1 

3 

2 

1 

Line 

4e 

C 

1.551074 

1 

108.956238 

1 

179.014664 

1 

4 

3 

2 

Line 

4f 

C 

1.541928 

1 

113.074843 

1 

179.724877 

1 

5 

4 

3 

Line 

4g 

C 

1.551502 

1 

113.039652 

1 

179.525806 

1 

6 

5 

4 

Line 

4h 

0 

1.402677 

1 

108.663575 

1 

179.855864 

1 

7 

6 

5 

Line 

4i 

C 

1.402671 

1 

119.250433 

1 

-179.637345 

1 

8 

7 

6 

Line 

4j 

C 

1.552020 

1 

108.665746 

1 

-179.161900 

1 

9 

8 

7 

Line 

4k 

XX 

1.552507 

1 

112.659354 

1 

-178.914985 

1 

10 

9 

8 

Line 

41 

XX 

1.547723 

1 

113.375266 

1 

-179.924995 

1 

11 

10 

9 

Line 

4m 

H 

1.114250 

1 

89.824605 

1 

126.911018 

1 

1 

3 

2 

Line 

4n 

H 

1.114708 

1 

89.909148 

1 

-126.650667 

1 

1 

3 

2 

Line 

4o 

H 

1.123297 

1 

93.602831 

1 

127.182594 

1 

2 

4 

3 

Line 

4p 

H 

1.123640 

1 

93.853406 

1 

-126.320187 

1 

2 

4 

3 

Line 

4q 

H 

1.123549 

1 

90.682924 

1 

126.763659 

1 

4 

6 

5 

Line 

4r 

H 

1.123417 

1 

90.679889 

1 

-127.033695 

1 

4 

6 

5 

Line 

4s 

H 

1.114352 

1 

90.239157 

1 

126.447043 

1 

5 

7 

6 

Line 

4t 

H 

1.114462 

1 

89.842852 

1 

-127.140168 

1 

5 

7 

6 

Line 

4u 

H 

1.114340 

1 

89.831790 

1 

126.653999 

1 

6 

8 

7 

Line 

4v 

H 

1.114433 

1 

89.753913 

1 

-126.926618 

1 

6 

8 

7 

Line 

4w 

H 

1.123126 

1 

93.644744 

1 

127.030541 

1 

7 

9 

8 

Line 

4x 

H 

1.123225 

1 

93.880969 

1 

-126.380511 

1 

7 

9 

8 

Line 

4y 

H 

1.123328 

1 

90.261019 

1 

127.815464 

1 

9 

11 

10 

Line 

4z 

H 

1.123227 

1 

91.051403 

1 

-125.914234 

1 

9 

11 

10 

Line 

4A 

H 

1.113970 

1 

90.374545 

1 

126.799259 

1 

10 

12 

11 

Line 

4B 

H 

1.114347 

1 

90.255788 

1 

-126.709810 

1 

10 

12 

11 

Line 

4C 

Tv 

12.299490 

1 

0.000000 

0 

0.000000 

0 

1 

11 

10 

Line 

5 

0 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

Polytetrahydrofuran  has  a  repeat  unit  of  (C4  H8  0)2;  i.e., 
twice  the  monomer  unit.  This  is  necessary  in  order  to  allow  the 
lattice  to  repeat  after  a  translation  through  12.3  Angstroms.  See 
the  section  on  Solid  State  Capability  for  further  details. 
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Note  the  two  dummy  atoms  on  lines  4k  and  41.  These  are  useful, 
but  not  essential,  for  defining  the  geometry.  The  atoms  on  lines  4y 
to  4B  use  these  dummy  atoms,  as  does  the  translation  vector  on  line 
4C.  The  translation  vector  has  only  the  length  marked  for 
optimization.  The  reason  for  this  is  also  explained  in  the 
Background  chapter. 
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CHAPTER  2 


KEYWORDS 


2.1  SPECIFICATION  OF  KEYWORDS 

All  control  data  are  entered  in  the  form  of  keywords,  which  form  the 
first  line  of  a  data-file.  A  description  of  what  each  keyword  does  is 
given  in  Section  2-3.  The  order  in  which  keywords  appear  is  not 
important  although  they  must  be  separated  by  a  space.  Some  keywords  can 
be  abbreviated,  allowed  abbreviations  are  noted  in  Section  2-3  (for 
example  1ELECTR0N  can  be  entered  as  1 ELECT) .  However  the  full  keyword  is 
preferred  in  order  to  more  clearly  document  the  calculation  and  to 
obviate  the  possibility  that  an  abbreviated  keyword  might  not  be 
recognized.  If  there  is  insufficient  space  in  the  first  line  for  all  the 
keywords  needed,  then  consider  abbreviating  the  longer  words.  One  type 
of  keyword,  those  with  an  equal  sign,  such  as,  BAR=0.05,  may  not  be 
abbreviated,  and  the  full  word  needs  to  be  supplied. 

Most  keywords  which  involve  an  equal  sign,  such  as  SCFCRT=1.D-12 
can,  at  the  user's  discretion,  be  written  with  spaces  before  and  after 
the  equal  sign.  Thus  all  permutations  of  SCFCRT= l.D-12,  such  as 
SCFCRT  =1 .D-12 ,  SCFCRT  =  l.D-12,  SCFCRT=  l.D-12,  SCFCRT  =  l.D-12,  etc. 
are  allowed.  Exceptions  to  this  are  T=,  T-PRIORITY=,  H-PRIORITY=, 
X-PRIORITY=,  IRC= ,  DRC=  and  TRANS=.  '  T= '  cannot  be  abbreviated  to  ’  T  ' 
as  many  keywords  start  or  end  with  a  'T';  for  the  other  keywords  the 
associated  abbreviated  keywords  have  specific  meanings. 

If  two  keywords  which  are  incompatible,  like  UHF  and  C.I.,  are 
supplied,  or  a  keyword  which  is  incompatible  with  the  species  supplied, 
for  instance  TRIPLET  and  a  methyl  radical,  then  error  trapping  will 
normally  occur,  and  an  error  message  will  be  printed.  This  usually  takes 
an  insignificant  time,  so  data  are  quickly  checked  for  obvious  errors. 
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2.2  FULL  LIST 


& 

+ 

OSCF 

1ELECTRON- 

1SCF 

AIDER 

AIGIN 

AIGOUT 

ANALYT 

AMI 

BAR=n . n  - 
BIRADICAL- 
BONDS 
C.I. 

CHARGE =n  - 

COMPFG 

CONNOLLY  - 

DEBUG 

DENOUT 

DENSITY  - 

DEP 

DEPVAR=n  - 
DERIV 
DFORCE 
DFP 

DIPOLE 

DIPX 

DIPY 

DIPZ 

DMAX 

DOUBLET  - 
DRC 

DUMP=n 

ECHO 

EF 

EIGINV 

EIGS 

ENPART 

ESP 

ESPRST 

ESR 

EXCITED  - 
EXTERNAL  - 
FILL=n 

FLEPO 

FMAT 

FOCK 

FORCE 

GEO-OK 


OF  KEYWORDS  USED  IN  MOPAC 


TURN  NEXT  LINE  INTO  KEYWORDS 

ADD  ANOTHER  LINE  OF  KEYWORDS 

READ  IN  DATA,  THEN  STOP 

PRINT  FINAL  ONE-ELECTRON  MATRIX 

DO  ONE  SCF  AND  THEN  STOP 

READ  IN  AB  INITIO  DERIVATIVES 

GEOMETRY  MUST  BE  IN  GAUSSIAN  FORMAT 

IN  ARC  FILE,  INCLUDE  AB-INITIO  GEOMETRY 

USE  ANALYTICAL  DERIVATIVES  OF  ENERGY  W.R.T.  GEOMETRY 

USE  THE  AMI  HAMILTONIAN 

REDUCE  BAR  LENGTH  BY  A  MAXIMUM  OF  run 

SYSTEM  HAS  TWO  UNPAIRED  ELECTRONS 

PRINT  FINAL  BOND-ORDER  MATRIX 

A  MULTI-ELECTRON  CONFIGURATION  INTERACTION  SPECIFIED 
CHARGE  ON  SYSTEM  =  n  (e.g.  NH4  =>  CHARGE=1 ) 

PRINT  HEAT  OF  FORMATION  CALCULATED  IN  COMPFG 

USE  CONNOLLY  SURFACE 

DEBUG  OPTION  TURNED  ON 

DENSITY  MATRIX  OUTPUT  (CHANNEL  10) 

PRINT  FINAL  DENSITY  MATRIX 

GENERATE  FORTRAN  CODE  FOR  PARAMETERS  FOR  NEW  ELEMENTS 
TRANSLATION  VECTOR  IS  A  MULTIPLE  OF  BOND-LENGTH 
PRINT  PART  OF  WORKING  IN  DERIV 

FORCE  CALCULATION  SPECIFIED,  ALSO  PRINT  FORCE  MATRIX. 

USE  DAVIDON-FLETCHER-POWELL  METHOD  TO  OPTIMIZE  GEOMETRIES 

FIT  THE  ESP  TO  THE  CALCULATED  DIPOLE 

X  COMPONENT  OF  DIPOLE  TO  BE  FITTED 

Y  COMPONENT  OF  DIPOLE  TO  BE  FITTED 

Z  COMPONENT  OF  DIPOLE  TO  BE  FITTED 

MAXIMUM  STEPSIZE  IN  EIGENVECTOR  FOLLOWING 

DOUBLET  STATE  REQUIRED 

DYNAMIC  REACTION  COORDINATE  CALCULATION 
WRITE  RESTART  FILES  EVERY  n  SECONDS 
DATA  ARE  ECHOED  BACK  BEFORE  CALCULATION  STARTS 
USE  EF  ROUTINE  FOR  MINIMUM  SEARCH 

PRINT  ALL  EIGENVALUES  IN  ITER 
PARTITION  ENERGY  INTO  COMPONENTS 
ELECTROSTATIC  POTENTIAL  CALCULATION 
RESTART  OF  ELECTROSTATIC  POTENTIAL 
CALCULATE  RHF  UNPAIRED  SPIN  DENSITY 
OPTIMIZE  FIRST  EXCITED  SINGLET  STATE 
READ  PARAMETERS  OFF  DISK 

IN  RHF  OPEN  AND  CLOSED  SHELL,  FORCE  M.O.  n 
TO  BE  FILLED 

PRINT  DETAILS  OF  GEOMETRY  OPTIMIZATION 
PRINT  DETAILS  OF  WORKING  IN  FMAT 
PRINT  LAST  FOCK  MATRIX 
FORCE  CALCULATION  SPECIFIED 
OVERRIDE  INTERATOMIC  DISTANCE  CHECK 
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GNORM-n . n- 

GRADIENTS- 

GRAPH 

HCORE 

HESS=N 

H-PRIO 

HYPERFINE- 

IRC 

ISOTOPE  - 
ITER 
ITRY=N 
IUPD 

K= (N,N)  - 

KINETIC  - 
LINMIN 
LARGE 
LET 

LOCALIZE  - 
MAX 
MECI 
MICROS 
MINDO/3  - 
MMOK 
MODE-N 
MOLDAT 
MS=N 
MULLIK 
NLLSQ 
NOANCI 
NODUS 
NOINTER  - 
NOLOG 
NOMM 
NONR 

NOTH I EL  - 
NSURF=N 
NOXYZ 
NSURF 
OLDENS 
OLDGEO 
OPEN 
ORIDE 
PARASOK  - 
PI 
PL 
PM3 

POINT=N 
POINTl=N 
POINT2=N 
POLAR 
POTWRT 
POWSQ 
PRECISE 
POLAY 
QUARTET 


EXIT  WHEN  GRADIENT  NORM  DROPS  BELOW  n.n 

PRINT  ALL  GRADIENTS 

GENERATE  FILE  FOR  GRAPHICS 

PRINT  DETAILS  OF  WORKING  IN  HCORE 

OPTIONS  FOR  CALCULATING  HESSIAN  MATRICES  IN  EF 

HEAT  OF  FORMATION  TAKES  PRIORITY  IN  DRC 

HYPERFINE  COUPLING  CONSTANTS  TO  BE  CALCULATED 

INTRINSIC  REACTION  COORDINATE  CALCULATION 

FORCE  MATRIX  WRITTEN  TO  DISK  (CHANNEL  9  ) 

PRINT  DETAILS  OF  WORKING  IN  ITER 

SET  LIMIT  OF  NUMBER  OF  SCF  ITERATIONS  TO  N. 

MODE  OF  HESSIAN  UPDATE  IN  EIGENVECTOR  FOLLOWING 

BRILLOUIN  ZONE  STRUCTURE  TO  BE  CALCULATED 

EXCESS  KINETIC  ENERGY  ADDED  TO  DRC  CALCULATION 

PRINT  DETAILS  OF  LINE  MINIMIZATION 

PRINT  EXPANDED  OUTPUT 

OVERRIDE  CERTAIN  SAFETY  CHECKS 

PRINT  LOCALIZED  ORBITALS 

PRINTS  MAXIMUM  GRID  SIZE  (23*23) 

PRINT  DETAILS  OF  MECI  CALCULATION 
USE  SPECIFIC  MICROSTATES  IN  THE  C.I. 

USE  THE  MINDO/3  HAMILTONIAN 

USE  MOLECULAR  MECHANICS  CORRECTION  TO  CONH  BONDS 

IN  EF,  FOLLOW  HESSIAN  MODE  NO.  N 

PRINT  DETAILS  OF  WORKING  IN  MOLDAT 

IN  MECI,  MAGNETIC  COMPONENT  OF  SPIN 

PRINT  THE  MULLIKEN  POPULATION  ANALYSIS 

MINIMIZE  GRADIENTS  USING  NLLSQ 

DO  NOT  USE  ANALYTICAL  C.I.  DERIVATIVES 

DO  NOT  USE  DIIS  GEOMETRY  OPTIMIZER 

DO  NOT  PRINT  INTERATOMIC  DISTANCES 

SUPPRESS  LOG  FILE  TRAIL,  WHERE  POSSIBLE 

DO  NOT  USE  MOLECULAR  MECHANICS  CORRECTION  TO  CONH  BONDS 

DO  NOT  USE  THIEL'S  FSTMIN  TECHNIQUE 

NUMBER  OF  SURFACES  IN  AN  ESP  CALCULATION 

DO  NOT  PRINT  CARTESIAN  COORDINATES 

NUMBER  OF  LAYERS  USED  IN  ELECTROSTATIC  POTENTIAL 

READ  INITIAL  DENSITY  MATRIX  OFF  DISK 

PREVIOUS  GEOMETRY  TO  BE  USED 

OPEN-SHELL  RHF  CALCULATION  REQUESTED 

IN  AMI  CALCULATIONS  SOME  MNDO  PARAMETERS  ARE  TO  BE  USED 
RESOLVE  DENSITY  MATRIX  INTO  SIGMA  AND  PI  BONDS 
MONITOR  CONVERGENCE  OF  DENSITY  MATRIX  IN  IT**?. 

USE  THE  MNDO-PM3  HAMILTONIAN 
NUMBER  OF  POINTS  IN  REACTION  PATH 

NUMBER  OF  POINTS  IN  FIRST  DIRECTION  IN  GRID  CALCULATION 
NUMBER  OF  POINTS  IN  SECOND  DIRECTION  IN  GRID  CALCULATION 
CALCULATE  FIRST,  SECOND  AND  THIRD  ORDER  POLARIZABILITIES 
IN  ESP,  WRITE  OUT  ELECTROSTATIC  POTENTIAL  TO  UNIT  21 
PRINT  DETAILS  OF  WORKING  IN  POWSQ 
CRITERIA  TO  BE  INCREASED  BY  100  TIMES 
USE  PULAY'S  CONVERGER  TO  OBTAIN  A  SCF 
QUARTET  STATE  REQUIRED 


12 


KEYWORDS 


Page  2-4 


QUINTET 

RECALC =N 

RESTART 

ROOT=n 

ROT=n 

SADDLE 

SCALE 

SCFCRT=n 

SCINCR 

SETUP 

SEXTET 

SHIFT=n 

SIGMA 

SINGLET 

SLOPE 

SPIN 

STEP 

STEPl=n 

STEP2=n 

STO-3G 

SYMAVG 

SYMMETRY 

T=n 

THERMO 

TIMES 

T-PRIO 

TRANS 

TRIPLET 

TS 

UHF 

VECTORS 

VELOCITY 

WILLIAMS 

X-PRIO 

XYZ 


QUINTET  STATE  REQUIRED 

IN  EF,  RECALCULATE  HESSIAN  EVERY  N  STEPS 
CALCULATION  RESTARTED 

ROOT  n  TO  BE  OPTIMIZED  IN  A  C.I.  CALCULATION 
THE  SYMMETRY  NUMBER  OF  THE  SYSTEM  IS  n. 

OPTIMIZE  TRANSITION  STATE 

SCALING  FACTOR  FOR  VAN  DER  WAALS  DISTANCE  IN  ESP 
DEFAULT  SCF  CRITERION  REPLACED  BY  THE  VALUE  SUPPLIED 
INCREMENT  BETWEEN  LAYERS  IN  ESP 
EXTRA  KEYWORDS  TO  BE  READ  OF  SETUP  FILE 
SEXTET  STATE  REQUIRED 

A  DAMPING  FACTOR  OF  n  DEFINED  TO  START  SCF 

MINIMIZE  GRADIENTS  USING  SIGMA 

SINGLET  STATE  REQUIRED 

MULTIPLIER  USED  TO  SCALE  MNDO  CHARGES 

PRINT  FINAL  UHF  SPIN  MATRIX 

STEP  SIZE  IN  PATH 

STEP  SIZE  n  FOR  FIRST  COORDINATE  IN  GRID  CALCULATION 

STEP  SIZE  n  FOR  SECOND  COORDINATE  IN  GRID  CALCULATION 

DEORTHOGONALIZE  ORBITALS  IN  STO-3G  BASIS 

AVERAGE  SYMMETRY  EQUIVALENT  ESP  CHARGES 

IMPOSE  SYMMETRY  CONDITIONS 

A  TIME  OF  n  SECONDS  REQUESTED 

PERFORM  A  THERMODYNAMICS  CALCULATION 

PRINT  TIMES  OF  VARIOUS  STAGES 

TIME  TAKES  PRIORITY  IN  DRC 

THE  SYSTEM  IS  A  TRANSITION  STATE 

(USED  IN  THERMODYNAMICS  CALCULATION) 

TRIPLET  STATE  REQUIRED 
USING  EF  ROUTINE  FOR  TS  SEARCH 
UNRESTRICTED  HARTREE-FOCK  CALCULATION 
PRINT  FINAL  EIGENVECTORS 

SUPPLY  THE  INITIAL  VELOCITY  VECTOR  IN  A  DRC  CALCULATION 
USE  WILLIAMS  SURFACE 

GEOMETRY  CHANGES  TAKE  PRIORITY  IN  DRC 

DO  ALL  GEOMETRIC  OPERATIONS  IN  CARTESIAN  COORDINATES. 
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2.3  DEFINITIONS  OF  KEYWORDS 

The  definitions  below  are  given  with  some  technical  expressions 
which  are  not  further  defined.  Interested  users  are  referred  to  Appendix 
E  of  this  manual  to  locate  appropriate  references  which  will  provide 
further  clarification. 

There  are  three  classes  of  keywords:  (1)  those  which  CONTROL 
substantial  aspects  of  the  calculation,  i.e.,  those  which  affect  the 
final  heat  of  formation,  (2)  those  which  determine  which  OUTPUT  will  be 
calculated  and  printed,  and  (3)  those  which  dictate  the  WORKING  of  the 
calculation,  but  which  do  not  affect  the  heat  of  formation.  The 
assignment  to  one  of  these  classes  is  designated  by  a  (C),  (O)  or  (W), 
respectively,  following  each  keyword  in  the  list  below. 


&  <C) 

An  1  & '  means  'turn  the  next  line  into  keywords'.  Note  the  space 
before  the  'J_  sign.  Since  is  a  keyword,  it  must  be  preceeded  by  a 
space.  A  '  & '  on  line  1  would  mean  that  a  second  line  of  keywords  should 
be  read  in.  If  that  second  line  contained  a  '  &',  then  a  third  line  of 
keywords  would  be  read  in.  If  the  first  line  has  a  '  &'  then  the  first 
description  line  is  omitted,  if  the  second  line  has  a  '  &',  then  both 
description  lines  are  omitted. 

Examples 
Use  of  one  ' & ' 

VECTORS  DENSITY  RESTART  &  NLLSQ  T=1H  SCFCRT=1.D~8  DUMP=30M  ITRY=300 
PM3  FOCK  OPEN (2,2)  ROOT=3  SINGLET  SHIFT=30 
Test  on  a  totally  weird  system 

Use  of  two  ' & ' s 

LARGE=-10  &  DRC=4 . 0  T=1H  SCFCRT=l.D-8  DUMP=30M  ITRY=300  SHIFT=30 
PM3  OPEN (2,2)  R00T=3  SINGLET  NOANCI  ANALYT  T-PRIORITY=0 . 5  & 

LET  GEO-OK  VELOCITY  KINETIC=5.0 


+  (C) 

A  '  +'  sign  means  'read  another  line  of  keywords'.  Note  the  space 
before  the  '+'  sign.  Since  '+'  is  a  keyword,  it  must  be  preceeded  by  a 
space.  A  '  +'  on  line  1  would  mean  that  a  second  line  of  keywords  should 
be  read  in.  If  that  second  line  contains  a  '  then  a  third  line  of 
keywords  will  be  read  in.  Regardless  of  whether  a  second  or  a  third  line 
of  keywords  is  read  in,  the  next  two  lines  would  be  description  lines. 
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Example  of  '  +'  option 

RESTART  T=4D  FORCE  OPEN(2,2)  SHIFT=20  PM3  + 

SCFCRT=1 . D-8  DEBUG  +  ISOTOPE  FMAT  ECHO  singlet  ROOT=3 
THERMO (300 ,400,1)  ROT=3 

Example  of  data  set  with  three  lines  of  keywords.  NOTE:  There 
are  two  lines  of  description,  this  and  the  previous  line. 


OSCF  (0) 

The  data  can  be  read  in  and  output,  but  no  actual  calculation  is 
performed  when  this  keyword  is  used.  This  is  useful  as  a  check  on  the 
input  data.  All  obvious  errors  are  trapped,  and  warning  messages 
printed. 

A  second  use  is  to  convert  from  one  format  to  another.  The  input 
geometry  is  printed  in  various  formats  at  the  end  of  a  OSCF  calculation. 
If  NOINTER  is  absent,  cartesian  coordinates  are  printed. 
Unconditionally,  MOPAC  Z-matrix  internal  coordinates  are  printed,  and  if 
AIGOUT  is  present,  Gaussian  Z-matrix  internal  coordinates  are  printed. 
OSCF  should  now  be  used  in  place  of  DDUM. 


1ELECTRON  (0) 

The  final  one-electron  matrix  is  printed  out.  This  matrix  is 
composed  of  atomic  orbitals;  the  array  element  between  orbitals  i  and  j 
on  different  atoms  is  given  by 

H ( i , j )  =  0.5  x  (beta(i)  +beta(j))  x  overlap(i,j) 

The  matrix  elements  between  orbitals  i  and  j  on  the  same  atom  are 
calculated  from  the  electron-nuclear  attraction  energy,  and  also  from  the 
U(i)  value  if  i=j. 

The  one-electron  matrix  is  unaffected  by  (a)  the  charge  and  (b)  the 
electron  density.  It  is  only  a  function  of  the  geometry.  Abbreviation: 
1ELEC. 


1SCF  (C) 

When  users  want  to  examine  the  results  of  a  single  SCF  calculation 
of  a  geometry,  1SCF  should  be  used.  1SCF  can  be  used  in  conjunction  with 
RESTART,  in  which  case  a  single  SCF  calculation  will  be  done,  and  the 
results  printed. 

When  1SCF  is  used  on  its  own  (that  is,  RESTART  is  not  also  used) 
then  derivatives  will  only  be  calculated  if  GRAD  is  also  specified. 

1SCF  is  helpful  in  a  learning  situation.  MOPAC  normally  performs 
many  SCF  calculations,  and  in  order  to  minimize  output  when  following  the 
working  of  the  SCF  calculation,  1SCF  is  very  useful. 
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AIDER  (C) 

AIDER  allows  MOPAC  to  optimize  an  ab-initio  geometry.  To  use  it, 
calculate  the  ab-initio  gradients  using,  e.g.,  Gaussian.  Supply  MOPAC 
with  these  gradients,  after  converting  them  into  kcal/mol.  The  geometry 
resulting  from  a  MOPAC  run  will  be  nearer  to  the  optimized  ab-initio 
geometry  than  if  the  geometry  optimizer  in  Gaussian  had  been  used. 


AIGIN  (C) 

If  the  geometry  (Z-matrix)  is  specified  using  the  Gaussian-8X,  then 
normally  this  will  be  read  in  without  difficulty.  In  the  event  that  it 
is  mistaken  for  a  normal  MOPAC-type  Z-matrix,  the  keyword  AIGIN  is 
provided.  AIGIN  will  force  the  data-set  to  be  read  in  assuming  Gaussian 
format.  This  is  necessary  if  more  than  one  system  is  being  studied  in 
one  run. 


AIGOUT  (0) 

The  ARCHIVE  file  contains  a  data-set  suitable  for  submission  to 
MOPAC.  If,  in  addition  to  this  data-set,  the  Z-matrix  for  Gaussian  input 
is  wanted,  then  AIGOUT  (ab  initio  geometry  output),  should  be  used. 

The  Z-matrix  is  in  full  Gaussian  form.  Symmetry,  where  present, 
will  be  correctly  defined.  Names  of  symbolics  will  be  those  used  if  the 
original  geometry  was  in  Gaussian  format,  otherwise  'logical'  names  will 
be  used.  Logical  names  are  of  form  <t><a><b>[<c> ] [<d>]  where  <t>  is  'r' 
for  bond  length,  'a'  for  angle,  or  'd'  for  dihedral,  <a>  is  the  atom 
number,  <b>  is  the  atom  to  which  <a>  is  related,  <c>,  if  present,  is  the 
atom  number  to  which  <a>  makes  an  angle,  and  <d>,  if  present,  is  the  atom 
number  to  which  <a>  makes  a  dihedral. 


ANALYT  (W) 

By  default,  finite  difference  derivatives  of  energy  with  respect  to 
geometry  are  used.  If  ANALYT  is  specified,  then  analytical  derivatives 
are  used  instead.  Since  the  analytical  derivatives  are  over  Gaussian 
functions  —  a  ST0-6G  basis  set  is  used  —  the  overlaps  are  also  over 
Gaussian  functions.  This  will  result  in  a  very  small  (less  than  0.1 
Kcal/mole)  change  in  heat  of  formation.  Use  analytical  derivatives  (a) 
when  the  mantissa  used  is  less  than  about  51-53  bits,  or  (b)  when 
comparison  with  finite  difference  is  desired.  Finite  difference 
derivatives  are  still  used  when  non-var iationally  optimized  wavefunctions 
are  present. 


AMI  (C) 

The  AMI  method  is  to  be  used.  By  default  MNDO  is  run. 
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BAR=n.nn  (W) 

In  the  SADDLE  calculation  the  distance  between  the  two  geometries  is 
steadily  reduced  until  the  transition  state  is  located.  Sometimes, 
however,  the  user  may  want  to  alter  the  maximum  rate  at  which  the 
distance  between  the  two  geometries  reduces.  BAR  is  a  ratio,  normally 
0.15,  or  15  percent.  This  represents  a  maximum  rate  of  reduction  of  the 
bar  of  15  percent  per  step.  Alternative  values  that  might  be  considered 
are  BAR=0.05  or  BAR=0.10,  although  other  values  may  be  used.  See  also 
SADDLE. 

If  CPU  time  is  not  a  major  consideration,  use  BAR=0.03. 


BIRADICAL  (C) 

NOTE:  BIRADICAL  is  a  redundant  keyword,  and  represents  a  particular 
configuration  interaction  calculation.  Experienced  users  of  MECI  (q.v.) 
can  duplicate  the  effect  of  the  keyword  BIRADICAL  by  using  the  MECI 
keywords  OPEN (2, 2)  and  SINGLET. 

For  molecules  which  are  believed  to  have  biradicaloid  character  the 
option  exists  to  optimize  the  lowest  singlet  energy  state  which  results 
from  the  mixing  of  three  states.  These  states  are,  in  order,  (1)  the 
(micro)state  arising  from  a  one  electron  excitation  from  the  HOMO  to  the 
LUMO,  which  is  combined  with  the  microstate  resulting  from  the 
time-reversal  operator  acting  on  the  parent  microstate,  the  result  being 
a  full  singlet  state;  (2)  the  state  resulting  from  de-excitation  from  the 
formal  LUMO  to  the  HOMO;  and  (3)  the  state  resulting  from  the  single 
electron  in  the  formal  HOMO  being  excited  into  the  LUMO. 

Microstate  1  Microstate  2  Microstate  3 


Alpha  Beta  Alpha  Beta  Alpha  Beta  Alpha  Beta 


LUMO 


* 


* 


* 


+ 


HOMO 


* 


* 


*  * 


A  configuration  interaction  calculation  is  involved  here.  A  biradical 
calculation  done  without  C.I.  at  the  RHF  level  would  be  meaningless. 
Either  rotational  invariance  would  be  lost,  as  in  the  D2d  form  of 
ethylene,  or  very  artificial  barriers  to  rotations  would  be  found,  such 
as  in  a  methane  molecule  "orbiting"  a  D2d  ethylene.  In  both  cases  the 
inclusion  of  limited  configuration  interaction  corrects  the  error. 
BIRADICAL  should  not  be  used  if  either  the  HOMO  or  LUMO  is  degenerate;  in 
this  case,  the  full  manifold  of  HOMO  x  LUMO  should  be  included  in  the 
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C.I.,  using  MECI  options.  The  user  should  be  aware  of  this  situation. 
When  the  biradical  calculation  is  performed  correctly,  the  result  is 
normally  a  net  stabilization.  However,  if  the  first  singlet  excited 
state  is  much  higher  in  energy  than  the  closed-shell  ground  state, 
BIRADICAL  can  lead  to  a  destabilization.  Abbreviation:  BIRAD.  See  also 
MECI,  C.I.,  OPEN,  SINGLET. 


BONDS  (0) 

The  rotationally  invariant  bond  order  between  all  pairs  of  atoms  is 
printed.  In  this  context  a  bond  is  defined  as  the  sum  of  the  squares  of 
the  density  matrix  elements  connecting  any  two  atoms.  For  ethane, 
ethylene,  and  acetylene  the  carbon-carbon  bond  orders  are  roughly  1.00, 
2.00,  and  3.00  respectively.  The  diagonal  terms  are  the  valencies 
calculated  from  the  atomic  terms  only  and  are  defined  as  the  sum  of  the 
bonds  the  atom  makes  with  other  atoms.  In  UHF  and  non-variationally 
optimized  wavef unctions  the  calculated  valency  will  be  incorrect,  the 
degree  of  error  being  proportional  to  the  non-duodempotency  of  the 
density  matrix.  For  an  RHF  wavefunction  the  square  of  the  density  matrix 
is  equal  to  twice  the  density  matrix. 

The  bonding  contributions  of  all  M.O.'s  in  the  system  are  printed 
immediately  before  the  bonds  matrix.  The  idea  of  molecular  orbital 
valency  was  developed  by  Gopinathan,  Siddarth,  and  Ravimohan.  Just  as  an 
atomic  orbital  has  a  'valency',  so  has  a  molecular  orbital.  This  leads 
to  the  following  relations:  The  sum  of  the  bonding  contributions  of  all 
occupied  M.O.'s  is  the  same  as  the  sum  of  all  valencies  which,  in  turn  is 
equal  to  two  times  the  sum  of  all  bonds.  The  sum  of  the  bonding 
contributions  of  all  M.O.'s  is  zero. 


C. I . =n  (C) 

Normally  configuration  interaction  is  invoked  if  any  of  the  keywords 
which  imply  a  C.I.  calculation  are  used,  such  as  BIRADICAL,  TRIPLET  or 
QUARTET.  Note  that  ROOT=  does  not  imply  a  C.I.  calculation:  ROOT=  is 
only  used  when  a  C.I.  calculation  is  done.  However,  as  these  implied 
C.I.'s  involve  the  minimum  number  of  configurations  practical,  the  user 
may  want  to  define  a  larger  than  minimum  C.I.,  in  which  case  the  keyword 
C.I.=n  can  be  used.  When  C.I.=n  is  specified,  the  n  M.O.'s  which 
"bracket"  the  occupied-  virtual  energy  levels  will  be  used.  Thus,  C.I. =2 
will  include  both  the  HOMO  and  the  LUMO,  while  C.I.=1  (implied  for 
odd-electron  systems)  will  only  include  the  HOMO  (This  will  do  nothing 
for  a  closed-shell  system,  and  leads  to  Dewar's  half-electron  correction 
for  odd-electron  systems).  Users  should  be  aware  of  the  rapid  increase 
in  the  size  of  the  C.I.  with  increasing  numbers  of  M.O.'s  being  used. 
Numbers  of  microstates  implied  by  the  use  of  the  keyword  C.I.=n  on  its 
own  are  as  follows: 
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Keyword 

Even 

-electron 

systems 

Odd- 

-electron 

systems 

No.  of 
Alpha 

electrons. 

Beta 

conf igs 

No.  of  electrons 
Alpha  Beta 

,  configs 

C.I. si 

1 

1 

1 

1 

0 

1 

C.I. =2 

1 

1 

4 

1 

0 

2 

C.I. =3 

2 

2 

9 

2 

1 

9 

C.I. =4 

2 

2 

36 

2 

1 

24 

C.I. =5 

3 

3 

100 

3 

2 

100 

C.I. =6 

3 

3 

400 

3 

2 

300 

C.I. =7 

4 

4 

1225 

4 

3 

1225 

C. I .=8  (Do  not  use  unless  other  keywords  also  used,  see  below) 

If  a  change  of  spin  is  defined,  then  larger  numbers  of  M.O.'s  can  be 
used  up  to  a  maximum  of  10.  The  C.I.  matrix  is  of  size  100  x  100.  For 
calculations  involving  up  to  100  configurations,  the  spin-states  are 
exact  eigenstates  of  the  spin  operators.  For  systems  with  more  than  100 
configurations,  the  100  configurations  of  lowest  energy  are  used.  See 
also  MICROS  and  the  keywords  defining  spin-states. 

Note  that  for  any  system,  use  of  C.I. =5  or  higher  normally  implies 
the  diagonalization  of  a  100  by  100  matrix.  As  a  geometry  optimization 
using  a  C.I.  requires  the  derivatives  to  be  calculated  using  derivatives 
of  the  C.I.  matrix,  geometry  optimization  with  large  C.I.'s  will  require 
more  time  than  smaller  C.I.'s. 

Associated  keywords:  MECI,  ROOT=,  MICROS,  SINGLET,  DOUBLET,  etc. 


C.I.=(n,m) 

In  addition  to  specifying  the  number  of  M.O.'s  in  the  active  space, 
the  number  of  electrons  can  also  be  defined.  In  C.I.=(n,m),  n  is  the 
number  of  M.O.s  in  the  active  space,  and  m  is  the  number  of  doubly  filled 
levels  to  be  used. 

EXAMPLES 

Keywords  Number  of  M.O.s  No.  Electrons 


C.I. =2  2  2  (1) 

C.I. =(2,1)  2  2  (3) 

C.I. =(3,1)  3  2  (3) 

C.I. =(3,2)  3  4  (5) 

C.I. =(3,0)  OPEN (2,3)  3  2  (N/A) 

C.I. =(3,1)  OPEN (2,2)  3  4  (N/A) 

C.I. =(3,1)  OPEN (1,2)  3  N/A  (3) 


Odd  electron  systems  given  in  parentheses. 
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CHARGE=n  (C) 


When  the  system  being  studied  is  an  ion,  the  charge,  n,  on  the  ion 
must  be  supplied  by  CHARGE=n.  For  cations  n  can  be  1  or  2  or  3,  etc,  for 
anions  -1  or  -2  or  -3,  etc. 

EXAMPLES 


ION 

NH4 ( +) 
C2H5(+) 
S04 ( = ) 
HS04 ( - ) 


KEYWORD 

CHARGE =1 
CHARGED 
CHARGE=-2 
CHARGE=-1 


ION 

CH3COO(~) 
(COO)(=) 
P04( 3-) 
H2P04(-) 


KEYWORD 

CHARGE=-1 

CHARGE=-2 

CHARGE=-3 

CHARGE=-1 


DCART  (O) 

The  cartesian  derivatives  which  are  calculated  in  DCART  for 
variationally  optimized  systems  are  printed  if  the  keyword  DCART  is 
present.  The  derivatives  are  in  units  of  kcals/Angstrom,  and  the 
coordinates  are  displacements  in  x,  y,  and  z. 


DEBUG  (O) 

Certain  keywords  have  specific  output  control  meanings,  such  as 
FOCK,  VECTORS  and  DENSITY,  If  they  are  used,  only  the  final  arrays  of 
the  relevant  type  are  printed.  If  DEBUG  is  supplied,  then  all  arrays  are 
printed.  This  is  useful  in  debugging  ITER.  DEBUG  can  also  increase  the 
amount  of  output  produced  when  certain  output  keywords  are  used,  e.g. 
COMPFG. 


DENOUT  (0) 

The  density  matrix  at  the  end  of  the  calculation  is  to  be  output  in 
a  form  suitable  for  input  in  another  job.  If  an  automatic  dump  due  to 
the  time  being  exceeded  occurs  during  the  current  run  then  DENOUT  is 
invoked  automatically,  (see  RESTART) 


DENSITY  (0) 

At  the  end  of  a  job,  when  the  results  are  being  printed,  the  density 
matrix  is  also  printed.  For  RHF  the  normal  density  matrix  is  printed. 
For  UHF  the  sum  of  the  alpha  and  beta  density  matrices  is  printed. 

If  density  is  not  requested,  then  the  diagonal  of  the  density 
matrix,  i.e.,  the  electron  density  on  the  atomic  orbitals,  will  be 
printed. 
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DEP  (0) 

For  use  only  with  EXTERNAL2 .  When  new  parameters  are  published , 
they  can  be  entered  at  run-time  by  using  EXTERNAL2,  but  as  this  is 
somewhat  clumsy,  a  permanent  change  can  be  made  by  use  of  DEP. 

If  DEP  is  invoked,  a  complete  block  of  FORTRAN  code  will  be 
generated,  and  this  can  be  inserted  directly  into  the  BLOCK  DATA  file. 

Note  that  the  output  is  designed  for  use  with  PM3 .  By  modifying  the 
names,  the  output  can  be  used  with  MNDO  or  AMI. 


DEPVAR=n.nn  (C) 

In  polymers  the  translation  vector  is  frequently  a  multiple  of  some 
internal  distance.  For  example,  in  polythene  it  is  the  C1-C3  distance. 
If  a  cluster  unit  cell  of  C6H12  is  used,  then  symmetry  can  be  used  to  tie 
together  all  the  carbon  atom  coordinates  and  the  translation  vector 
distance.  In  this  example  DEPVAR=3.0  would  be  suitable. 


DFP  (W) 

By  default  the  Broyden-Fletcher-Goldfarb-Shanno  method  will  be  used 
to  optimize  geometries.  The  older  Davidon-Fletcher-Powell  method  can  be 
invoked  by  specifying  DFP.  This  is  intended  to  be  used  for  comparison  of 
the  two  methods. 


DIPOLE  (C) 

Used  in  the  ESP  calculation,  DIPOLE  will  constrain  the  calculated 
charges  to  reproduce  the  cartesian  dipole  moment  components  calculated 
from  the  density  matrix  and  nuclear  charges. 


DIPX  (C) 

Similar  to  DIPOLE,  except  the  fit  will  be  for  the  X-component  only. 


DIPY  (C) 

Similar  to  DIPOLE,  except  the  fit  will  be  for  the  Y-component  only. 

DIPZ  (C) 

Similar  to  DIPOLE,  except  the  fit  will  be  for  the  Z-component  only. 
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DMAX=n.nn  (W) 

In  the  EF  routine,  the  maximum  step-size  is  0.2  (Angstroms  or 
radians),  by  default.  This  can  be  changed  by  specifying  DMAX=n.nn. 
Increasing  DMAX  can  lead  to  faster  convergence  but  can  also  make  the 
optimization  go  bad  very  fast.  Furthermore,  the  Hessian  updating  may 
deteriorate  when  using  large  stepsizes.  Reducing  the  stepsize  to  0.10  or 
0.05  is  recommended  when  encountering  convergence  problems. 


DOUBLET  (C) 

When  a  configuration  interaction  calculation  is  done,  all  spin 
states  are  calculated  simultaneously,  either  for  component  of  spin  =  0  or 
1/2.  When  only  doublet  states  are  of  interest,  then  DOUBLET  can  be 
specified,  and  all  other  spin  states,  while  calculated,  are  ignored  in 
the  choice  of  root  to  be  used. 

Note  that  while  almost  every  odd-electron  system  will  have  a  doublet 
ground  state,  DOUBLET  should  still  be  specified  if  the  desired  state  must 
be  a  doublet. 

DOUBLET  has  no  meaning  in  a  UHF  calculation. 


DRC  (C) 

A  Dynamic  Reaction  Coordinate  calculation  is  to  be  run.  By  default, 
total  energy  is  conserved,  so  that  as  the  "reaction"  proceeds  in  time, 
energy  is  transferred  between  kinetic  and  potential  forms. 


DRC=n.nnn  (C) 

In  a  DRC  calculation,  the  "half-life"  for  loss  of  kinetic  energy  is 
defined  as  n.nnn  femtoseconds.  If  n.nnn  is  set  to  zero,  infinite  damping 
simulating  a  very  condensed  phase  is  obtained. 

This  keyword  cannot  be  written  with  spaces  around  the  '  =  '  sign. 


DUMP  (W) 

Restart  files  are  written  automatically  at  one  hour  cpu  time 
intervals  to  allow  a  long  job  to  be  restarted  if  the  job  is  terminated 
catastrophically.  To  change  the  frequency  of  dump,  set  DUMP-nn  to 
request  a  dump  every  nn  seconds.  Alternative  forms,  DUMP=nnM,  DUMP=nnH, 
DUMP=nnD  for  a  dump  every  nn  minutes,  hours,  or  days,  respectively.  DUMP 
only  works  with  geometry  optimization,  gradient  minimization,  path,  and 
FORCE  calculations.  It  does  not  (yet)  work  with  a  SADDLE  calculation. 
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ECHO  (0) 

Data  are  echoed  back  if  ECHO  is  specified.  Only  useful  if  data  are 
suspected  to  be  corrupt. 


EF  (C) 

The  Eigenvector  Following  routine  is  an  alternative  to  the  BFGS,  and 
appears  to  be  much  faster.  To  invoke  the  Eigenvector  Following  routine, 
specify  EF.  EF  is  particularly  good  in  the  end-game,  when  the  gradient 
is  small.  See  also  HESS,  DMAX,  EIGINV. 


EIGINV  (W) 

Not  recommended  for  normal  use.  Used  with  the  EF  routine.  See 
source  code  for  more  details. 


ENPART  (0) 

This  is  a  very  useful  tool  for  analyzing  the  energy  terms  within  a 
system.  The  total  energy,  in  eV,  obtained  by  the  addition  of  the 
electronic  and  nuclear  terms,  is  partitioned  into  mono-  and  bi-centric 
contributions,  and  these  contributions  in  turn  are  divided  into  nuclear 
and  one-  and  two-electron  terms. 


ESP  (C) 

This  is  the  Electrostatic  Potential  calculation  of  K.  M.  Merz  and 
B.  H.  Besler.  ESP  calculates  the  expectation  values  of  the  electrostatic 
potential  of  a  molecule  on  a  uniform  distribution  of  points.  The 
resultant  ESP  surface  is  then  fitted  to  atom  centered  charges  that  best 
reproduce  the  distribution,  in  a  least  squares  sense. 


ESPRST  (W) 

ESPRST  restarts  a  stopped  ESP  calculation.  Do  not  use  with  RESTART. 
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ESR  (0) 

The  unpaired  spin  density  arising  from  an  odd-electron  system  can  be 
calculated  both  RHF  and  UHF.  In  a  UHF  calculation  the  alpha  and  beta 
M.O.'s  have  different  spatial  forms,  so  unpaired  spin  density  can 
naturally  be  present  on  in-plane  hydrogen  atoms  such  as  in  the  phenoxy 
radical. 

In  the  RHF  formalism  a  MECI  calculation  is  performed.  If  the 
keywords  OPEN  and  C.I.®  are  both  absent  then  only  a  single  state  is 
calculated.  The  unpaired  spin  density  is  then  calculated  from  the  state 
function.  In  order  to  have  unpaired  spin  density  on  the  hydrogens  in, 
for  example,  the  phenoxy  radical,  several  states  should  be  mixed. 


EXCITED  (C) 

The  state  to  be  calculated  is  the  first  excited  open-shell  singlet 
state.  If  the  ground  state  is  a  singlet,  then  the  state  calculated  will 
be  S(l);  if  the  ground  state  is  a  triplet,  then  S(2).  This  state  would 
normally  be  the  state  resulting  from  a  one-electron  excitation  from  the 
HOMO  to  the  LUMO.  Exceptions  would  be  if  the  lowest  singlet  state  were  a 
biradical,  in  which  case  the  EXCITED  state  could  be  a  closed  shell. 

The  EXCITED  state  will  be  calculated  from  a  BIRADICAL  calculation  in 
which  the  second  root  of  the  C.I.  matrix  is  selected.  Note  that  the 
eigenvector  of  the  C.I.  matrix  is  not  used  in  the  current  formalism. 
Abbreviation:  EXCI. 

NOTE:  EXCITED  is  a  redundant  keyword,  and  represents  a  particular 
configuration  interaction  calculation.  Experienced  users  of  MECI  can 
duplicate  the  effect  of  the  keyword  EXCITED  by  using  the  MECI  keywords 
OPEN (2,2),  SINGLET,  and  ROOT=2. 


EXTERNAL=name  (C) 

Normally,  PM3,  AMI  and  MNDO  parameters  are  taken  from  the  BLOCK  DATA 
files  within  MOPAC.  When  the  supplied  parameters  are  not  suitable,  as  in 
an  element  recently  parameterized,  and  the  parameters  have  not  yet 
installed  in  the  user's  copy  of  MOPAC,  then  the  new  parameters  can  be 
inserted  at  run  time  by  use  of  EXTERNAL=<f ilename>,  where  <filename>  is 
the  name  of  the  file  which  contains  the  new  parameters. 

<filename>  consists  of  a  series  of  parameter  definitions  in  the 
format 

<Parameter>  <Element>  <Value  of  parameter> 

where  the  possible  parameters  are  USS,  UPP,  UDD,  ZS,  ZP,  ZD,  BETAS, 
BETAP,  BETAD,  GSS,  GSP,  GPP,  GP2,  HSP,  ALP,  FNnm,  n=l,2,  or  3,  and  m=l  to 
10,  and  the  elements  are  defined  by  their  chemical  symbols,  such  as  Si  or 
SI. 
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When  new  parameters  for  elements  are  published,  they  can  be  typed  in 
as  shown.  This  file  is  ended  by  a  blank  line,  the  word  END  or  nothing, 
i.e.,  no  end-of-file  delimiter.  An  example  of  a  parameter  data  file 
would  be: 

Start  of  line)  (Put  at  least  2  spaces  before  and  after  parameter  name) 


Line 

1 

USS 

Si 

-34.08201495 

Line 

2 

UPP 

Si 

-28.03211675 

Line 

3 

BETAS 

Si 

-5.01104521 

Line 

4 

BETAP 

Si 

-2.23153969 

Line 

5 

ZS 

Si 

1.28184511 

Line 

6 

ZP 

Si 

1.84073175 

Line 

7 

ALP 

Si 

2.18688712 

Line 

8 

GSS 

Si 

9.82 

Line 

9 

GPP 

Si 

7.31 

Line 

10 

GSP 

Si 

8.36 

Line 

11 

GP2 

Si 

6.54 

Line 

12 

HSP 

Si 

1.32 

Derived  parameters  do  no  need  to  be  entered;  they  will  be  calculated 
from  the  optimized  parameters.  All  "constants"  such  as  the  experimental 
heat  of  atomization  are  already  inserted  for  all  elements. 

NOTE:  EXTERNAL  can  only  be  used  to  input  parameters  for  MNDO,  AMI, 
or  PM3 .  It  is  unlikely,  however,  that  any  more  MINDO/3  parameters  will 
be  published. 

See  also  DEP  to  make  a  permanent  change. 


FILL=n  (C) 

The  n'th  M.O.  in  an  RHF  calculation  is  constrained  to  be  filled. 
It  has  no  effect  on  a  UHF  calculation.  After  the  first  iteration  (NOTE: 
not  after  the  first  SCF  calculation,  but  after  the  first  iteration  within 
the  first  SCF  calculation)  the  n'th  M.O.  is  stored,  and,  if  occupied,  no 
further  action  is  taken  at  that  time.  If  unoccupied,  then  the  HOMO  and 
the  n'th  M.O.’s  are  swapped  around,  so  that  the  n'th  M.O.  is  now  filled. 
On  all  subsequent  iterations  the  M.O.  nearest  in  character  to  the  stored 
M.O.  is  forced  to  be  filled,  and  the  stored  M.O.  replaced  by  that  M.O. 
This  is  necessitated  by  the  fact  that  in  a  reaction  a  particular  M.O. 
may  change  its  character  considerably.  A  useful  procedure  is  to  run  1SCF 
and  DENOOT  first,  in  order  to  identify  the  M.O.'s;  the  complete  job  is 
then  run  with  OLDENS  and  FILL=nn,  so  that  the  eigenvectors  at  the  first 
iteration  are  fully  known.  As  FILL  is  known  to  give  difficulty  at  times, 
consider  also  using  C.I.=n  and  ROOT=m. 


FLEPO  (0) 

The  predicted  and  actual  changes  in  the  geometry,  the  derivatives, 
and  search  direction  for  each  geometry  optimization  cycle  are  printed. 
This  is  useful  if  there  is  any  question  regarding  the  efficiency  of  the 
geometry  optimizer. 
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FMAT 

Details  of  the  construction  of  the  Hessian  matrix  for  the  force 
calculation  are  to  be  printed. 


FORCE  ( C ) 

A  force-calculation  is  to  be  run.  The  Hessian,  that  is  the  matrix 
(in  millidynes  per  Angstrom)  of  second  derivatives  of  the  energy  with 
respect  to  displacements  of  all  pairs  of  atoms  in  x,  y,  and  z  directions, 
is  calculated.  On  diagonalization  this  gives  the  force  constants  for  the 
molecule.  The  force  matrix,  weighted  for  isotopic  masses,  is  then  used 
for  calculating  the  vibrational  frequencies.  The  system  can  be 

characterized  as  a  ground  state  or  a  transition  state  by  the  presence  of 
five  (for  a  linear  system)  or  six  eigenvalues  which  are  very  small  (less 
than  about  30  reciprocal  centimeters).  A  transition  state  is  further 
characterized  by  one,  and  exactly  one,  negative  force  constant. 

A  FORCE  calculation  is  a  prerequisite  for  a  THERMO  calculation. 

Before  a  FORCE  calculation  is  started,  a  check  is  made  to  ensure 

that  a  stationary  point  is  being  used.  This  check  involves  calculating 

the  gradient  norm  (GNORM)  and  if  it  is  significant,  the  GNORM  will  be 

|  reduced  using  BFGS.  All  internal  coordinates  are  optimized,  and  any 
symmetry  constraints  are  ignored  at  this  point.  An  implication  of  this 
is  that  if  the  specification  of  the  geometry  relies  on  any  angles  being 
exactly  180  or  zero  degrees,  the  calculation  may  fail. 

The  geometric  definition  supplied  to  FORCE  should  not  rely  on  angles 
or  dihedrals  assuming  exact  values.  (The  test  of  exact  linearity  is 
sufficiently  slack  that  most  molecules  that  are  linear,  such  as  acetylene 
and  but-2-yne,  should  not  be  stopped.)  See  also  THERMO.  LET,  TRANS, 
ISOTOPE. 

In  a  FORCE  calculation,  PRECISE  will  eliminate  quartic  contamination 
(part  of  the  anharmonicity ) .  This  is  normally  not  important,  therefore 
PRECISE  should  not  routinely  be  used.  In  a  FORCE  calculation,  the  SCF 
criterion  is  automatically  made  more  stringent;  this  is  the  main  cause  of 
the  SCF  failing  in  a  FORCE  calculation. 


GEO-OK  (W) 

Normally  the  program  will  stop  with  a  warning  message  if  two  atoms 
are  within  0.8  Angstroms  of  each  other,  or,  more  rarely,  the  BFGS  routine 
has  difficulty  optimizing  the  geometry.  GEO-OK  will  over-ride  the  job 
termination  sequence,  and  allow  the  calculation  to  proceed.  In  practice, 
most  jobs  that  terminate  due  to  these  checks  contain  errors  in  data,  so 
caution  should  be  exercised  if  GEO-OK  is  used.  An  important  exception  to 
this  warning  is  when  the  system  contains,  or  may  give  rise  to,  a  Hydrogen 
molecule.  GEO-OK  will  override  other  geometric  safety  checks  such  as  the 
unstable  gradient  in  a  geometry  optimization  preventing  reliable 
optimization. 
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See  also  the  message  "GRADIENTS  OF  OLD  GEOMETRY,  GNORM=  nn.nnnn" 


GNORM=n.nn  (W) 

The  geometry  optimization  termination  criteria  in  both  gradient 
inimization  and  energy  minimization  can  be  over-ridden  by  specifying  a 
radient  norm  requirement.  For  example,  GNORM=20  would  allow  the 
eometry  optimization  to  exit  as  soon  as  the  gradient  norm  dropped  below 
0.0,  the  default  being  1.0. 

For  high-precision  work,  GNORM=0.0  is  recommended.  Unless  LET  is 
Iso  used,  the  GNORM  will  be  set  to  the  larger  of  0.01  and  the  specified 
NORM.  Results  from  GNORM=0.01  are  easily  good  enough  for  all 
igh-precision  work. 


GRADIENTS  (0) 

In  a  1SCF  calculation  gradients  are  not  calculated  by  default:  in 
on-variationally  optimized  systems  this  would  take  an  excessive  time. 
RADI ENTS  allows  the  gradients  to  be  calculated.  Normally,  gradients 
ill  not  be  printed  if  the  gradient  norm  is  less  than  2.0.  However,  if 
RADI ENTS  is  present,  then  the  gradient  norm  and  the  gradients  will 
nconditionally  be  printed.  Abbreviation:  GRAD. 


GRAPH  (0) 

Information  needed  to  generate  electron  density  contour  maps  can  be 
ritten  to  a  file  by  calling  GRAPH.  GRAPH  first  calls  MULLIK  in  order  to 
enerate  the  inverse-square-root  of  the  overlap  matrix,  which  is  required 
or  the  re-normalization  of  the  eigenvectors.  All  data  essential  for  the 
raphics  package  DENSITY  are  then  output. 


HESS=n  (W) 

When  the  Eigenvector  Following  routine  is  used  for  geometry 
ptimization,  it  frequently  works  faster  if  the  Hessian  is  constructed 
irst.  If  HESS=1  is  specified,  the  Hessian  matrix  will  be  constructed 
efore  the  geometry  is  optimized.  There  are  other,  less  common,  options, 
.g.  HESS-2.  See  comments  in  subroutine  EF  for  details. 


H-PRIORITY  (0) 

In  a  DRC  calculation,  results  will  be  printed  whenever  the 
alculated  heat  of  formation  changes  by  0.1  Kcal/mole.  Abbreviation: 
-PRIO. 
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H-PRIORITY=n.nn  (0) 

In  a  DRC  calculation,  results  will  be  printed  whenever  the 
calculated  heat  of  formation  changes  by  n.nn  Kcal/mole. 


IRC  (C) 

An  Intrinsic  Reaction  Coordinate  calculation  is  to  be  run.  All 
kinetic  energy  is  shed  at  every  point  in  the  calculation.  See 
Background. 


IRC=n  (C) 

An  Intrinsic  Reaction  Coordinate  calculation  to  be  run;  an  initial 
perturbation  in  the  direction  of  normal  coordinate  n  to  be  applied.  If  n 
is  negative,  then  perturbation  is  reversed,  i.e.,  initial  motion  is  in 
the  opposite  direction  to  the  normal  coordinate.  This  keyword  cannot  be 
written  with  spaces  around  the  '='  sign. 


ISOTOPE  (0) 

Generation  of  the  FORCE  matrix  is  very  time-consuming,  and  in 
isotopic  substitution  studies  several  vibrational  calculations  may  be 
needed.  To  allow  the  frequencies  to  be  calculated  from  the  (constant) 
force  matrix,  ISOTOPE  is  used.  When  a  FORCE  calculation  is  completed, 
ISOTOPE  will  cause  the  force  matrix  to  be  stored,  regardless  of  whether 
or  not  any  intervening  restarts  have  been  made.  To  re-calculate  the 
frequencies,  etc.  starting  at  the  end  of  the  force  matrix  calculation, 
specify  RESTART. 

The  two  keywords  RESTART  and  ISOTOPE  can  be  used  together.  For 
ex.  nple,  if  a  normal  FORCE  calculation  runs  for  a  long  time,  the  user  may 
want  to  divide  it  up  into  stages  and  save  the  final  force  matrix.  Once 
ISOTOPE  has  been  used,  it  does  not  need  to  be  used  on  subsequent  RESTART 
runs. 

ISOTOPE  can  also  be  used  with  FORCE  to  set  up  a  RESTART  file  for  an 
IRC=n  calculation. 


ITRY=NN  (W) 

The  default  maximum  number  of  SCF  iterations  is  200.  When  this 
limit  presents  difficulty,  ITRY=nn  can  be  used  to  re-define  it.  For 
example,  if  ITRY=400  is  used,  the  maximum  number  of  iterations  will  be 
set  to  400.  ITRY  should  normally  not  be  changed  until  all  other  means  of 
obtaining  a  SCF  have  been  exhausted,  e.g.  PULAY  CAMP-KING  etc. 
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IUPD=n  (W) 

IUPD  is  used  only  in  the  EF  routine.  IUPD  should  very  rarely  be 
touched.  IUPD=1  can  be  used  in  minimum  searches  if  the  the  message 
"HEREDITARY  POSITIVE  DEFINITENESS  ENDANGERED.  UPDATE  SKIPPED  THIS  CYCLE" 
occurs  every  cycle  for  10-20  iterations.  Never  use  IUPD=2  for  a  TS 
search!  For  more  information,  read  the  comments  in  subroutine  EF. 


K=(n.nn,n)  (C) 

Used  in  band-structure  calculations,  K=(n.nn,n)  specifies  the 
step-size  in  the  Brillouin  zone,  and  the  number  of  atoms  in  the  monomeric 
unit.  Two  band-structure  calculations  are  supported:  electronic  and 

phonon.  Both  require  a  polymer  to  be  used.  If  FORCE  is  used,  a  phonon 
spectrum  is  assumed,  otherwise  an  electronic  band  structure  is  assumed. 
For  both  calculations,  a  density  of  states  is  also  done.  The  band 
structure  calculation  is  very  fast,  so  a  small  step-size  will  not  use 
much  time. 

The  output  is  designed  to  be  fed  into  a  graphics  package,  and  is  not 
'elegant'.  For  polyethylene,  a  suitable  keyword  would  be  K=( 0.01,6). 


KINETIC=n.nnn  (C) 

In  a  DRC  calculation  n.nnn  Kcals/mole  of  excess  kinetic  energy  is 
added  to  the  system  as  soon  as  the  kinetic  energy  builds  up  to  0.2 
Kcal/mole.  The  excess  energy  is  added  to  the  velocity  vector,  without 
change  of  direction. 


LARGE  (0) 


Most  of  the  time  the  output  invoked  by  keywords  is  sufficient. 
LARGE  will  cause  less-conunonly  wanted,  but  still  useful,  output  to  be 
printed. 

1.  To  save  space,  DRC  and  IRC  outputs  will,  by  default,  only  print  the 
line  with  the  percent  sign.  Other  output  can  be  obtained  by  use  of  the 
keyword  LARGE,  according  to  the  following  rules: 


Keyword 

LARGE 

LARGE=1 

LARGE=-1 

LARGE=n 

LARGE=-n 


Effect 

Print  all  internal  and  cartesian  coordinates 
and  cartesian  velocities. 

Print  all  internal  coordinates. 

Print  all  internal  and  cartesian  coordinates 
and  cartesian  velocities. 

Print  every  n'th  set  of  internal  coordinates. 
Print  every  n'th  set  of  internal  and  cartesian 
coordinates  and  cartesian  velocities. 


If  LARGE=1  is  used,  the  output  will  be  the  same  as  that  of  Version 
5.0,  when  LARGE  was  not  used.  If  LARGE  is  used,  the  output  will  be  the 
same  as  that  of  Version  5.0,  when  LARGE  was  used.  To  save  disk  space,  do 
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not  use  LARGE. 


LINMIN  (0) 

There  are  two  line-minimization  routines  in  MOPAC,  an  energy 
minimization  and  a  gradient  norm  minimization.  LINMIN  will  output 
details  of  the  line  minimization  used  in  a  given  job. 


LET  (W) 

As  MOPAC  evolves,  the  meaning  of  LET  is  changing. 

Now  LET  means  essentially  "I  know  what  I'm  doing,  override  safety 
checks" . 

Currently,  LET  has  the  following  meanings 

1.  In  a  FORCE  calculation,  it  means  that  the  supplied  geometry  is 
to  be  used,  even  if  the  gradients  are  large. 

2.  In  a  geometry  optimization,  the  specified  GNORM  is  to  be  used, 
even  if  it  is  less  than  0.01. 

3.  In  a  POLAR  calculation,  the  molecule  is  to  be  orientated  along 
its  principal  moments  of  inertia  before  the  calculation  starts. 
LET  will  prevent  this  step  being  done. 


LOCALIZE  (0) 

The  occupied  eigenvectors  are  transformed  into  a  localized  set  of 
M.O.'s  by  a  series  of  2  by  2  rotations  which  maximize  <psi**4>.  The 
value  of  l/<psi**4>  is  a  direct  measure  of  the  number  of  centers  involved 
in  the  M.O..  Thus  the  value  of  l/<psi**4>  is  2.0  for  H2,  3.0  for  a 
three-center  bond  and  1.0  for  a  lone  pair.  Higher  degeneracies  than 
allowed  by  point  group  theory  are  readily  obtained.  For  example,  benzene 
would  give  rise  to  a  6-fold  degenerate  C-H  bond,  a  6-fold  degenerate  C-C 
sigma  bond  and  a  three-fold  degenerate  C-C  pi  bond.  In  principle,  there 
is  no  single  step  method  to  unambiguously  obtain  the  most  localized  set 
of  M.O.'s  in  systems  where  several  canonical  structures  are  possible, 
just  as  no  simple  method  exists  for  finding  the  most  stable  conformer  of 
some  large  compound.  However,  the  localized  bonds  generated  will 
normally  be  quite  acceptable  for  routine  applications.  Abbreviation: 
LOCAL. 


MAX 

In  a  grid  calculation,  the  maximum  number  of  points  (23)  in  each 
direction  is  to  be  used.  The  default  is  11.  The  number  of  points  in 
each  direction  can  be  set  with  POINTS1  and  POINTS 2 . 
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MECI  (0) 

At  the  end  of  the  calculation  details  of  the  Multi  Electron 
Configuration  Interaction  calculation  are  printed  if  MECI  is  specified. 
The  state  vectors  can  be  printed  by  specifying  VECTORS.  The  MECI 
calculation  is  either  invoked  automatically,  or  explicitly  invoked  by  the 
use  of  the  C.l.=n  keyword. 


MICR0S=n  (C) 

The  microstates  used  by  MECI  are  normally  generated  by  use  of  a 
permutation  operator.  When  individually  defined  microstates  are  desired, 
then  MICROS-n  can  be  used,  where  n  defines  the  number  of  microstates  to 
be  read  in. 


Format  for  Microstates 

After  the  geometry  data  plus  any  symmetry  data  are  read  in,  data 
defining  each  microstate  is  read  in,  using  format  2011,  one  microstate 
per  line.  The  microstate  data  is  preceded  by  the  word  "MICROS"  on  a  line 
by  itself.  There  is  at  present  no  mechanism  for  using  MICROS  with  a 
reaction  path. 

For  a  system  with  n  M.O. 's  in  the  C.I.  (use  0PEN=(nl,n)  or  C.I.=n  to 
do  this),  the  populations  of  the  n  alpha  M.O. 's  are  defined,  followed  by 
the  n  beta  M.O.'s.  Allowed  occupancies  are  zero  and  one.  For  n=6  the 
closed-shell  ground  state  would  be  defined  as  111000111000,  meaning  one 
electron  in  each  of  the  first  three  alpha  M.O.'s,  and  one  electron  in 
each  of  the  first  three  beta  M.O.'s. 

Users  are  warned  that  they  are  responsible  for  completing  any  spin 
manifolds.  Thus  while  the  state  111100110000  is  a  triplet  state  with 
component  of  spin  =1,  the  state  111000110100,  while  having  a  component 
of  spin  =  0  is  neither  a  singlet  nor  a  triplet.  In  order  to  complete  the 
spin  manifold  the  microstate  110100111000  must  also  be  included. 

If  a  manifold  of  spin  states  is  not  complete,  then  the  eigenstates 
of  the  spin  operator  will  not  be  quantized.  When  and  only  when  100  or 
fewer  microstates  are  supplied,  can  spin  quantization  be  conserved. 

There  are  two  other  limitations  on  possible  microstates.  First,  the 
number  of  electrons  in  every  microstate  should  be  the  same.  If  they 
differ,  a  warning  message  will  be  printed,  and  the  calculation  continued 
(but  the  results  will  almost  certainly  be  nonsense).  Second,  the 
component  of  spin  for  every  microstate  must  be  the  same,  except  for 
teaching  purposes.  Two  microstates  of  different  components  of  spin  will 
have  a  zero  matrix  element  connecting  them.  No  warning  will  be  given  as 
this  is  a  reasonable  operation  in  a  teaching  situation.  For  example,  if 
all  states  arising  from  two  electrons  in  two  levels  are  to  be  calculated, 
say  for  teaching  Russel-Saunders  coupling,  then  the  following  microstates 
would  be  used: 
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crostate 

No.  of  alpha, 

beta  electrons 

Ms 

State 

1100 

2 

0 

1 

Triplet 

1010 

1 

1 

0 

Singlet 

1001 

1 

1 

0 

Mixed 

0110 

1 

1 

0 

Mixed 

0101 

1 

1 

0 

Singlet 

0011 

0 

2 

-1 

Triplet 

Constraints  on  the  space  manifold  are  just  as  rigorous,  but  much 
easier  to  satisfy.  If  the  energy  levels  are  degenerate,  then  all 
components  of  a  manifold  of  degenerate  M.O.'s  should  be  either  included 
or  excluded.  If  only  some,  but  not  all,  components  are  used,  the 
required  degeneracy  of  the  states  will  be  missing. 

As  an  example,  for  the  tetrahedral  methane  cation,  if  the  user 
supplies  the  microstates  corresponding  to  a  component  of  spin  =  3/2, 
neglecting  Jahn-Teller  distortion,  the  minimum  number  of  states  that  can 
be  supplied  is  90  =  ( 6 !/( 1 ! *5 ! ) ) * ( 6 !/C 4 1 *2 ! ) ) . 

While  the  total  number  of  electrons  should  be  the  same  for  all 
microstates,  this  number  does  not  need  to  be  the  same  as  the  number  of 
electrons  supplied  to  the  C.I.;  thus  in  the  example  above,  a  cationic 
state  could  be  110000111000. 

The  format  is  defined  as  2011  so  that  spaces  can  be  used  for  empty 
M.O.'s. 


MINDO/3  (C) 

The  default  Hamiltonian  within  MOPAC  is  MNDO,  with  the  alternatives 
of  AMI  and  MINDO/3.  To  use  the  MINDO/3  Hamiltonian  the  keyword  MINDO/3 
should  be  used.  Acceptable  alternatives  to  the  keyword  MINDO/3  are  MINDO 
and  MIND03 . 


MMOK  ( C) 

If  the  system  contains  a  peptide  linkage,  then  MMOK  will  allow  a 
molecular  mechanics  correction  to  be  applied  so  that  the  barrier  to 
rotation  is  increased  (to  14.00  Kcal/mole  in  N-methyl  acetamide). 


MODE  (C) 

MODE  is  used  in  the  EF  routine.  Normally  the  default  MODE=l  is  used 
to  locate  a  transition  state,  but  if  this  is  incorrect,  explicitly  define 
the  vector  to  be  followed  by  using  MODE=n.  (MODE  is  not  a  recommended 
keyword).  If  you  use  the  FORCE  option  when  deciding  which  mode  to 
follow,  set  all  isotopic  masses  to  1.0.  The  normal  modes  from  FORCE  are 
normally  mass-weighted;  this  can  mislead.  Alternatively,  use  LARGE  with 
FORCE:  this  gives  the  force  constants  and  vectors  in  addition  to  the 
mass-weighted  normal  modes.  Only  the  mass-weighted  modes  can  be  drawn 
with  DRAW. 
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MS=n 

Useful  for  checking  the  MECI  calculation  and  for  teaching.  MS=n 
overrides  the  normal  choice  of  magnetic  component  of  spin.  Normally,  if 
a  triplet  is  requested,  an  MS  of  1  will  be  used;  this  excludes  all 
singlets.  If  MS=0  is  also  given,  then  singlets  will  also  be  calculated. 
The  use  of  MS  should  not  affect  the  values  of  the  results  at  all. 


MULLIK  (0) 

A  full  Mulliken  Population  analysis  is  to  be  done  on  the  final  RHF 
wavef unction.  This  involves  the  following  steps: 

(1)  The  eigenvector  matrix  is  divided  by  the  square  root 
of  the  overlap  matrix,  S. 

(2)  The  Coulson-type  density  matrix,  P,  is  formed. 

(3)  The  overlap  population  is  formed  from  P(i, j)*S(i, j) . 

(4)  Half  the  off-diagonals  are  added  onto  the  diagonals. 


NLLSQ  (C) 

The  gradient  norm  is  to  be  minimized  by  Bartel's  method.  This  is  a 
Non-Linear  Least  Squares  gradient  minimization  routine.  Gradient 
minimization  will  locate  one  of  three  possible  points: 

(a)  A  minimum  in  the  energy  surface.  The  gradient  norm  will  go  to 
zero,  and  the  lowest  five  or  six  eigenvalues  resulting  from  a  FORCE 
calculation  will  be  approximately  zero. 

(b)  A  transition  state.  The  gradient  norm  will  vanish,  as  in  (a), 
but  in  this  case  the  system  is  characterized  by  one,  and  only  one, 
negative  force  constant. 

(c)  A  local  minimum  in  the  gradient  norm  space.  In  this  (normally 
unwanted)  case  the  gradient  norm  is  minimized,  but  does  not  go  to  zero. 
A  FORCE  calculation  will  not  give  the  five  or  six  zero  eigenvalues 
characteristic  of  a  stationary  point.  While  normally  undesirable,  this 
is  sometimes  the  only  way  to  obtain  a  geometry.  For  instance,  if  a 
system  is  formed  which  cannot  be  characterized  as  an  intermediate,  and  at 
the  same  time  is  not  a  transition  state,  but  nonetheless  has  some 
chemical  significance,  then  that  state  can  be  refined  using  NLLSQ. 


NOANCI  (W) 

RHF  open-shell  derivatives  are  normally  calculated  using  Liotard's 
analytical  C.I.  method.  If  this  method  is  NOT  to  be  used,  specify  NOANCI 
(NO  ANalytical  Configuration  Interaction  derivatives). 
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NODIIS  (W) 

In  the  event  that  the  G-DIIS  option  is  not  wanted/  NODIIS  can  be 
used.  The  G-DIIS  normally  accelerates  the  geometry  optimization/  but 
there  is  no  guarantee  that  it  will  do  so.  If  the  heat  of  formation  rises 
unexpectedly  (i.e.r  rises  during  a  geometry  optimization  while  the  GNORM 
is  larger  than  about  0.3),  then  try  NODIIS. 


NOINTER  (0) 

The  interatomic  distances  are  printed  by  default.  If  you  do  not 
want  them  to  be  printed,  specify  NOINTER.  For  big  jobs  this  reduces  the 
output  file  considerably. 


NOLOG  (0) 

Normally  a  copy  of  the  archive  file  will  be  directed  to  the  LOG 
file,  along  with  a  synopsis  of  the  job.  If  this  is  not  wanted,  it  can  be 
suppressed  completely  by  NOLOG. 


NOMM  (C) 

All  four  semi-empirical  methods  underestimate  the  barrier  to 
rotation  of  a  peptide  bond.  A  Molecular  Mechanics  correction  has  been 
added  which  increases  the  barrier  in  N-methyl  acetamide  to  14  Kcal/mole. 
If  you  do  not  want  this  correction,  specify  NOMM  (NO  Molecular 
Mechanics) . 


NONR  (W) 

Not  recommended  for  normal  use.  Used  with  the  EF  routine.  See 
source  code  for  more  details. 


NOTH I EL  (W) 

In  a  normal  geometry  optimization  using  the  BFGS  routine,  Thiel's 
FSTMIN  technique  is  used.  If  normal  line-searches  are  wanted,  specify 
NOTH I EL . 


NOXYZ  (0) 

The  cartesian  coordinates  are  printed  by  default.  If  you  do  not 
want  them  to  be  printed,  specify  NOXYZ.  For  big  jobs  this  reduces  the 
output  file  considerably. 
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NSURF  (C) 

In  an  ESP  calculation,  NSURF=n  specifies  the  number  of  surface 
layers  for  the  Connolly  surface. 


OLDENS  (W) 

A  density  matrix  produced  by  an  earlier  run  of  MOPAC  is  to  be  used 
to  start  the  current  calculation.  This  can  be  used  in  attempts  to  obtain 
an  SCF  when  a  previous  calculation  ended  successfully  but  a  subsequent 
run  failed  to  go  SCF. 


OLDGEO  (C) 

If  multiple  geometries  are  to  be  run,  and  the  final  geometry  from 
one  calculation  is  to  be  used  to  start  the  next  calculation,  OLDGEO 
should  be  specified.  Example:  If  a  MNDO,  AMI,  and  PM3  calculation  were 
to  be  done  on  one  system,  for  which  only  a  rough  geometry  was  available, 
then  after  the  MNDO  calculation,  the  AMI  calculation  could  be  done  using 
the  optimized  MNDO  geometry  as  the  starting  geometry,  by  specifying 
OLDGEO. 


OPEN(nl,n2)  (C) 

The  M.O.  occupancy  during  the  SCF  calculation  can  be  defined  in 
terms  of  doubly  occupied,  empty,  and  fractionally  occupied  M.O. 's.  The 
fractionally  occupied  M.O.'s  are  defined  by  OPEN(nl,n2),  where  nl  = 
number  of  electrons  in  the  open-shell  manifold,  and  n2  =  number  of 
open-shell  M.O.'s;  nl/n2  must  be  in  the  range  0  to  2.  OPEN(l,l)  will  be 
assumed  for  odd-electron  systems  unless  an  OPEN  keyword  is  used.  Errors 
introduced  by  use  of  fractional  occupancy  are  automatically  corrected  in 
a  MECI  calculation  when  0PEN(nl,n2)  is  used. 


OR IDE  (W) 

Do  not  use  this  keyword  until  you  have  read  Simons'  article.  ORIDE 
is  part  of  the  EF  routine,  and  means  "Use  whatever  Lamdas  are  produced 
even  if  they  would  normally  be  'unacceptable'." 

J.  Simons,  P.  Jorgensen,  H.  Taylor,  J.  Ozment,  J.  Phys.  Chem.  87, 
2745  (1983). 


PARASOK  (W) 

USE  THIS  KEYWORD  WITH  EXTREME  CAUTION! I  The  AMI  method  has  been 
parameterized  for  only  a  few  elements,  less  than  the  number  available  to 
MNDO  or  PM3.  If  any  elements  which  are  not  parameterized  at  the  AMI 
level  are  specified,  the  MNDO  parameters,  if  available,  will  be  used. 
The  resulting  mixture  of  methods,  AMI  with  MNDO,  has  not  been  studied  to 
see  how  good  the  results  are,  and  users  are  strictly  on  their  own  as  far 
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as  accuracy  and  compatibility  with  other  methods  is  concerned.  In 
particular,  while  all  parameter  sets  are  referenced  in  the  output,  other 
programs  may  not  cite  the  parameter  sets  used  and  thus  compatibility  with 
other  MNDO  programs  is  not  guaranteed. 


PI  (0) 

The  normal  density  matrix  is  composed  of  atomic  orbitals,  that  is  s, 
px,  py  and  pz.  PI  allows  the  user  to  see  how  each  atom-atom  interaction 
is  split  into  sigma  and  pi  bonds.  The  resulting  "density  matrix"  is 
composed  of  the  following  basis-functions:-  s-sigma,  p-sigma,  p-pi, 
d-sigma,  d-pi,  d-dell.  The  on-diagonal  terms  give  the  hybridization 
state,  so  that  an  sp2  hybridized  system  would  be  represented  as  s-sigma 
1.0,  p-sigma  2.0,  p-pi  1.0 


PM3  (C) 

The  PM3  method  is  to  be  used. 


POINT=n  (C) 

The  number  of  points  to  be  calculated  on  a  reaction  path  is 
specified  by  POINT=n.  Used  only  with  STEP  in  a  path  calculation. 


POINTl=n  (C) 

In  a  grid  calculation,  the  number  of  points  to  be  calculated  in  the 
first  direction  is  given  by  P0INTl=n.  'n*  should  be  less  than  24; 
default:  11. 


POINT2=n  (C) 

In  a  grid  calculation,  the  number  of  points  to  be  calculated  in  the 
second  direction  is  given  by  P0INT2=n.  'n'  should  be  less  than  24, 
default:  11; 


POTWRT  (W) 

In  an  ESP  calculation,  write  out  surface  points  and  electrostatic 
potential  values  to  UNIT  21. 


POLAR  ( C) 

The  polarizability  and  first  and  second  hyperpolarizabilities  are  to 
be  calculated.  At  present  this  calculation  does  not  work  for  polymers, 
but  should  work  for  all  other  systems. 
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By  default,  an  electric  field  gradient  of  0.001  is  used.  This  can 
be  modified  by  specifying  POLAR=n . nnnnn ,  where  n.nnnnn  is  the  new  field. 

POLAR  calculates  the  polarizabilities  from  the  heat  of  formation  and 
from  the  dipole.  The  degree  to  which  they  agree  is  a  measure  of  the 
precision  (not  the  accuracy)  of  the  calculation.  The  results  from  the 
heat  of  formation  calculation  are  more  trustworthy  than  those  from  the 
dipole. 

Users  should  note  that  the  hyperpolarizabilities  obtained  have  to  be 
divided  by  2.0  for  beta  and  6.0  for  gamma  to  conform  with  experimental 
convention. 

Two  sets  of  results  are  printed:  a  set  (labeled  E4)  derived  from 
the  effect  of  the  applied  electric  field  on  the  heat  of  formation,  and  a 
set  (labeled  DIP)  derived  from  the  value  of  the  dipole  in  various 
electric  fields. 


POWSQ  (C) 

Details  of  the  working  of  POWSQ  are  printed  out.  This  is  only 
useful  in  debugging. 


PRECISE  (W) 

The  criteria  for  terminating  all  optimizations,  electronic  and 
geometric,  are  to  be  increased  by  a  factor,  normally,  100.  This  can  be 
used  where  more  precise  results  are  wanted.  If  the  results  are  going  to 
be  used  in  a  FORCE  calculation,  where  the  geometry  needs  to  be  known 
quite  precisely,  then  PRECISE  is  recommended;  for  small  systems  the  extra 
cost  in  CPU  time  is  minimal.  PRECISE  is  not  recommended  for  experienced 
users,  instead  GNORM=n.nn  and  SCFCRT=n.nn  are  suggested.  PRECISE  should 
only  very  rarely  be  necessary  in  a  FORCE  calculation:  all  it  does  is 
remove  quartic  contamination,  which  only  affects  the  trivial  modes 
significantly,  and  is  very  expensive  in  CPU  time. 


PULAY  (W) 

The  default  converger  in  the  SCF  calculation  is  to  be  replaced  by 
Pulay's  procedure  as  soon  as  the  density  matrix  is  sufficiently  stable. 
A  considerable  improvement  in  speed  can  be  achieved  by  the  use  of  PULAY. 
If  a  large  number  of  SCF  calculations  are  envisaged,  a  sample  calculation 
using  1SCF  and  PULAY  should  be  compared  with  using  1SCF  on  its  own,  and 
if  a  saving  in  time  results,  then  PULAY  should  be  used  in  the  full 
calculation.  PULAY  should  be  used  with  care  in  that  its  use  will  prevent 
the  combined  package  of  convergers  (SHIFT,  PULAY  and  the  CAMP-KING 
convergers)  from  automatically  being  used  in  the  event  that  the  system 
fails  to  go  SCF  in  (ITRY-10)  iterations. 

The  combined  set  of  convergers  very  seldom  fails. 
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QUARTET  (C) 

RHP  interpretation:  The  desired  spin-state  is  a  quartet,  i.e.,  the 
state  with  component  of  spin  =  1/2  and  spin  =  3/2.  When  a  configuration 
interaction  calculation  is  done,  all  spin  states  of  spin  equal  to,  or 
greater  than  1/2  are  calculated  simultaneously,  for  component  of  spin  = 
1/2.  From  these  states  the  quartet  states  are  selected  when  QUARTET  is 
specified,  and  all  other  spin  states,  while  calculated,  are  ignored  in 
the  choice  of  root  to  be  used.  If  QUARTET  is  used  on  its  own,  then  a 
single  state,  corresponding  to  an  alpha  electron  in  each  of  three  M.O. 's 
is  calculated. 

UHF  interpretation:  The  system  will  have  three  more  alpha  electrons 
than  beta  electrons. 


QUINTET  (C) 

RHF  interpretation:  The  desired  spin-state  is  a  quintet,  that  is, 
the  state  with  component  of  spin  =  0  and  spin  =  2.  When  a  configuration 
interaction  calculation  is  done,  all  spin  states  of  spin  equal  to,  or 
greater  than  0  are  calculated  simultaneously,  for  component  of  spin  =  0. 
From  these  states  the  quintet  states  are  selected  when  QUINTET  is 
specified,  and  the  septet  states,  while  calculated,  will  be  ignored  in 
the  choice  of  root  to  be  used.  If  QUINTET  is  used  on  its  own,  then  a 
single  state,  corresponding  to  an  alpha  electron  in  each  of  four  M.O.'s 
is  calculated. 

UHF  interpretation:  The  system  will  have  three  more  alpha  electrons 
than  beta  electrons. 


RECALC=n 

RECALC =n  calculates  the  Hessian  every  n  steps  in  the  EF 
optimization.  For  small  n  this  is  costly  but  is  also  very  effective  in 
terms  of  convergence.  RECALC=10  and  DMAX=0.10  can  be  useful  for 
difficult  cases.  In  extreme  cases  RECALC=1  and  DMAX=0.05  will  always 
find  a  stationary  point,  if  it  exists. 


RESTART  (W) 

When  a  job  has  been  stopped,  for  whatever  reason,  and  intermediate 
results  have  been  stored,  then  the  calculation  can  be  restarted  at  the 
point  where  it  stopped  by  specifying  RESTART.  The  most  common  cause  of  a 
job  stopping  before  completion  is  its  exceeding  the  time  allocated.  A 
saddle-point  calculation  has  no  restart,  but  the  output  file  contains 
information  which  can  easily  be  used  to  start  the  calculation  from  a 
point  near  to  where  it  stopped. 

It  is  not  necessary  to  change  the  geometric  data  to  reflect  the  new 
geometry.  As  a  result,  the  geometry  printed  at  the  start  of  a  restarted 
job  will  be  that  of  the  original  data,  not  that  of  the  restarted  file. 
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A  convenient  way  to  monitor  a  long  run  is  to  specify  1SCF  and 
RESTART;  this  will  give  a  normal  output  file  at  very  little  cost. 

NOTE  1:  In  the  FORCE  calculation  two  restarts  are  possible.  These 
are  (a)  a  restart  in  FLEPO  if  the  geometry  was  not  optimized  fully  before 
FORCE  was  called,  and  (b)  the  normal  restart  in  the  construction  of  the 
force  matrix.  If  the  restart  is  in  FLEPO  within  FORCE  then  the  keyword 
FORCE  should  be  deleted,  and  the  keyword  RESTART  used  on  its  own. 
Forgetting  this  point  is  a  frequent  cause  of  failed  jobs. 

NOTE  2:  Two  restarts  also  exist  in  the  IRC  calculation.  If  an  IRC 
calculation  stops  while  in  the  FORCE  calculation,  then  a  normal  restart 
can  be  done.  If  the  job  stops  while  doing  the  IRC  calculation  itself 
then  the  keyword  IRC=n  should  be  changed  to  IRC,  or  it  can  be  omitted  if 
DRC  is  also  specified.  The  absence  of  the  string  "IRC-"  is  used  to 
indicate  that  the  FORCE  calculation  was  completed  before  the  restart 
files  were  written. 


ROOT=n  (C) 

The  n'th  root  of  a  C.I.  calculation  is  to  be  used  in  the 
calculation.  If  a  keyword  specifying  the  spin-state  is  also  present, 
e.g.  SINGLET  or  TRIPLET,  then  the  n'th  root  of  that  state  will  be 
selected.  Thus  ROOT=3  and  SINGLET  will  select  the  third  singlet  root. 
If  ROOT=3  is  used  on  its  own,  then  the  third  root  will  be  used,  which  may 
be  a  triplet,  the  third  singlet,  or  the  second  singlet  (the  second  root 
might  be  a  triplet).  In  normal  use,  this  keyword  would  not  be  used.  It 
is  retained  for  educational  and  research  purposes.  Unusual  care  should 
be  exercised  when  ROOT=  is  specified. 


ROT=n  (C) 

In  the  calculation  of  the  rotational  contributions  to  the 
thermodynamic  quantities  the  symmetry  number  of  the  molecule  must  be 
supplied.  The  symmetry  number  of  a  point  group  is  the  number  of 
equivalent  positions  attainable  by  pure  rotations.  No  reflections  or 
improper  rotations  are  allowed.  This  number  cannot  be  assumed  by 
default,  and  may  be  affected  by  subtle  modifications  to  the  molecule, 
such  as  isotopic  substitution.  A  list  of  the  most  important  symmetry 
numbers  follows: 


TABLE 

OF  SYMMETRY 

NUMBERS 

Cl 

Cl  cs 

1 

D2  D2D  D2H 

4 

C(  INF)  V 

1 

C2 

C2V  C2H 

2 

D3  D3D  D3H 

6 

D(INF)H 

2 

C3 

C3V  C3H 

3 

D4  D40  D4B 

8 

T  TD 

12 

C4 

C4V  C4H 

4 

D6  D6D  D6H 

12 

OB 

24 

C6 

C6V  C6H 

6 

S6 

3 
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SADDLE  (C) 

The  transition  state  in  a  simple  chemical  reaction  is  to  be 
optimized.  Extra  data  are  required.  After  the  first  geometry, 
specifying  the  reactants,  and  any  symmetry  functions  have  been  defined, 
the  second  geometry,  specifying  the  products,  is  defined,  using  the  same 
format  as  that  of  the  first  geometry. 

SADDLE  often  fails  to  work  successfully.  Frequently  this  is  due  to 
equivalent  dihedral  angles  in  the  reactant  and  product  differing  by  about 
360  degrees  rather  than  zero  degrees.  As  the  choice  of  dihedral  can  be 
difficult,  users  should  consider  running  this  calculation  with  the 
keyword  XYZ.  There  is  normally  no  ambiguity  in  the  definition  of 
cartesian  coordinates.  See  also  BAR= . 

Many  of  the  bugs  in  SADDLE  have  been  removed  in  this  version.  Use 
of  the  XYZ  option  is  strongly  recommended. 


SCALE  (C) 

SCALE=n.n  specifies  the  scaling  factor  for  Van  der  Waals'  radii  for 
the  initial  layer  of  the  Connolly  surface  in  the  ESP  calculation. 


SCFCRT=n. nn  (W) 

The  default  SCF  criterion  is  to  be  replaced  by  that  defined  by 
SCFCRT= . 

The  SCF  criterion  is  the  change  in  energy  in  kcal/mol  on  two 
successive  iterations.  Other  minor  criteria  may  make  the  requirements 
for  an  SCF  slightly  more  stringent.  The  SCF  criterion  can  be  varied  from 
about  0.001  to  l.D-25,  although  numbers  in  the  range  0.0001  to  l.D-9  will 
suffice  for  most  applications. 

An  overly  tight  criterion  can  lead  to  failure  to  achieve  a  SCF,  and 
consequent  failure  of  the  run. 


SCINCR=n. nn 

In  an  ESP  calculation,  SCINCR=n.nn  specifies  the  increment  between 
layers  of  the  surface  in  the  Connolly  surface,  (default:  0.20) 


SETUP  (C) 

If,  on  the  keyword  line,  the  word  'SETUP'  is  specified,  then  one  or 
two  lines  of  keywords  will  be  read  from  a  file  with  the  logical  name 
SETUP.  The  logical  file  SETUP  must  exist,  and  must  contain  at  least  one 
line.  If  the  second  line  is  defined  by  the  first  line  as  a  keyword  line, 
and  the  second  line  contains  the  word  SETUP,  then  one  line  of  keywords 
will  be  read  from  a  file  with  the  logical  name  SETUP. 
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SETUP=name  (C) 

Same  as  SETUP,  only  the  logical  or  actual  name  of  the  SETUP  file  is 
'  name  * . 


SEXTET  (C) 

RHF  interpretation:  The  desired  spin-state  is  a  sextet:  the  state 
with  component  of  spin  =  1/2  and  spin  =  5/2. 

The  sextet  states  are  the  highest  spin  states  normally  calculable 
using  MOPAC  in  its  unmodified  form.  If  SEXTET  is  used  on  its  own,  then  a 
single  state,  corresponding  to  one  alpha  electron  in  each  of  five  M.O.'s, 
is  calculated.  If  several  sextets  are  to  be  calculated,  say  the  second 
or  third,  then  OPEN(nl,n2)  should  be  used. 

UHP  interpretation:  The  system  will  have  five  more  alpha  electrons 
than  beta  electrons. 


SHIFT=n.nn  (W) 

In  an  attempt  to  obtain  an  SCF  by  damping  oscillations  which  slow 
down  the  convergence  or  prevent  an  SCF  being  achieved,  the  virtual  M.O. 
energy  levels  are  shifted  up  or  down  in  energy  by  a  shift  technique.  The 
principle  is  that  if  the  virtual  M.O.'s  are  changed  in  energy  relative  to 
the  occupied  set,  then  the  polarizability  of  the  occupied  M.O.'s  will 
change  pro  rata.  Normally,  oscillations  are  due  to  autoregenerative 
charge  fluctuations. 

The  SHIFT  method  has  been  re-written  so  that  the  value  of  SHIFT 
changes  automatically  to  give  a  critically-damped  system.  This  can 
result  in  a  positive  or  negative  shift  of  the  virtual  M.O.  energy 
levels.  If  a  non-zero  SHIFT  is  specified,  it  will  be  used  to  start  the 
SHIFT  technique,  rather  than  the  default  15eV.  If  SHIFT=0  is  specified, 
the  SHIFT  technique  will  not  be  used  unless  normal  convergence  techniques 
fail  and  the  automatic  "ALL  CONVERGERS..."  message  is  produced. 


SIGMA  (C) 

The  Mclver-Komornicki  gradient  norm  minimization  routines,  POWSQ  and 
SEARCH  are  to  be  used.  These  are  very  rapid  routines,  but  do  not  work 
for  all  species.  If  the  gradient  norm  is  low,  i.e.,  less  than  about  5 
units,  then  SIGMA  will  probably  work;  in  most  cases,  NLLSQ  is 
recommended.  SIGMA  first  calculates  a  quite  accurate  Hessian  matrix,  a 
slow  step,  then  works  out  the  direction  of  fastest  decent,  and  searches 
along  that  direction  until  the  gradient  norm  is  minimized.  The  Hessian 
is  then  partially  updated  in  light  of  the  new  gradients,  and  a  fresh 
search  direction  found.  Clearly,  if  the  Hessian  changes  markedly  as  a 
result  of  the  line-search,  the  update  done  will  be  inaccurate,  and  the 
new  search  direction  will  be  faulty. 
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SIGMA  should  be  avoided  if  at  all  possible  when  non-variationally 
optimized  calculations  are  being  done. 

If  the  Hessian  is  suspected  to  be  corrupt  within  SIGMA  it  will  be 
automatically  recalculated.  This  frequently  speeds  up  the  rate  at  which 
the  transition  state  is  located.  If  you  do  not  want  the  Hessian  to  be 
reinitialized  —  it  is  costly  in  CPU  time  —  specify  LET  on  the  keyword 
line. 


SINGLET  (C) 

When  a  configuration  interaction  calculation  is  done,  all  spin 
states  are  calculated  simultaneously,  either  for  component  of  spin  -  0  or 
1/2.  When  only  singlet  states  are  of  interest,  then  SINGLET  can  be 
specified,  and  all  other  spin  states,  while  calculated,  are  ignored  in 
the  choice  of  root  to  be  used. 

Note  that  while  almost  every  even-electron  system  will  have  a 
singlet  ground  state,  SINGLET  should  still  be  specified  if  the  desired 
state  must  be  a  singlet. 

SINGLET  has  no  meaning  in  a  UHF  calculation,  but  see  also  TRIPLET. 


SLOPE  (C) 

In  an  ESP  calculation,  SLOPE=n.nn  specifies  the  scale  factor  for 
MNDO  charges.  (default=1.422) 


SPIN  (0) 

The  spin  matrix,  defined  as  the  difference  between  the  alpha  and 
beta  density  matrices,  is  to  be  printed.  If  the  system  has  a 
closed-shell  ground  state,  e.g.  methane  run  UHF,  the  spin  matrix  will  be 
null . 

If  SPIN  is  not  requested  in  a  UHF  calculation,  then  the  diagonal  of 
the  spin  matrix,  that  is  the  spin  density  on  the  atomic  orbitals,  will  be 
printed. 


STEP  (C) 

In  a  reaction  path,  if  the  path  step  is  constant,  STEP  can  be  used 
instead  of  explicitly  specifying  each  point.  The  number  of  steps  is 
given  by  POINT.  If  the  reaction  coordinate  is  an  interatomic  distance, 
only  positive  STEPs  are  allowed. 
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STEPl=n.nnn  (C) 

In  a  grid  calculation  the  step  size  in  degrees  or  Angstroms  for  the 
first  of  the  two  parameters  is  given  by  n.nnn.  By  default,  an  11  by  11 
grid  is  generated.  See  P0INT1  and  P0INT2  on  how  to  adjust  this  number. 
The  first  point  calculated  is  the  supplied  geometry,  and  is  in  the  upper 
left  hand  corner.  This  is  a  change  from  Version  5.00,  where  the  supplied 
geometry  was  the  central  point. 


STEP2=n.nnn  (C) 

In  a  grid  calculation  the  step  size  in  degrees  or  Angstroms  for  the 
second  of  the  two  parameters  is  given  by  n.nnn. 


ST03G  (W) 

In  an  ESP  calculation  ST03G  means  "Use  the  STO-3G  basis  set  to 
de-orthogonalize  the  semiempir ical  orbitals". 


SYMAVG  (W) 

Used  by  the  ESP,  SYMAVG  will  average  charges  which  should  have  the 
same  value  by  symmetry. 


SYMMETRY  (C) 

Symmetry  data  defining  related  bond  lengths,  angles  and  dihedrals 
can  be  included  by  supplying  additional  data  after  the  geometry  has  been 
entered.  If  there  are  any  other  data,  such  as  values  for  the  reaction 
coordinates,  or  a  second  geometry,  as  required  by  SADDLE,  then  it  would 
follow  the  symmetry  data.  Symmetry  data  are  terminated  by  one  blank 
line.  For  non-variationally  optimized  systems  symmetry  constraints  can 
save  a  lot  of  time  because  many  derivatives  do  not  need  to  be  calculated. 
At  the  same  time,  there  is  a  risk  that  the  geometry  may  be  wrongly 
specified,  e.g.  if  methane  radical  cation  is  defined  as  being 
tetrahedral,  no  indication  that  this  is  faulty  will  be  given  until  a 
FORCE  calculation  is  run.  (This  system  undergoes  spontaneous  Jahn-Teller 
distortion.) 

Usually  a  lower  heat  of  formation  can  be  obtained  when  SYMMETRY  is 
specified.  To  see  why,  consider  the  geometry  of  benzene.  if  no 
assumptions  are  made  regarding  the  geometry,  then  all  the  C-C  bond 
lengths  will  be  very  slightly  different,  and  the  angles  will  be  almost, 
but  not  quite  120  degrees.  Fixing  all  angles  at  120  degrees,  dihedrals 
at  180  or  0  degrees,  and  only  optimizing  one  C-C  and  one  C-H  bond-length 
will  result  in  a  2-D  optimization,  and  exact  D6h  symmetry.  Any 
deformation  from  this  symmetry  must  involve  error,  so  by  imposing 
symmetry  some  error  is  removed. 
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The  layout  of  the  symmetry  data  is: 

<defining  atom>  <symmetry  relation>  <defined  atom>  <defined  atom>,... 
where  the  numerical  code  for  <symmetry  relation?  is  given  in  the  table  of 
SYMMETRY  FUNCTIONS  below. 

For  example,  ethane,  with  three  independent  variables,  can  be 
defined  as 

SYMMETRY 


ETHANE,  D3D 

NA 

NB 

NC 

C 

C 

1.528853  1 

1 

H 

1.105161  1 

110.240079 

1 

2 

1 

H 

1.105161  0 

110.240079 

0 

120.000000 

0 

2 

1 

3 

H 

1.105161  0 

110.240079 

0 

240.000000 

0 

2 

1 

3 

H 

1.105161  0 

110.240079 

0 

60.000000 

0 

1 

2 

3 

H 

1.105161  0 

110.240079 

0 

180.000000 

0 

1 

2 

3 

H 

1.105161  0 

110.240079 

0 

300.000000 

0 

1 

2 

3 

0 

0.000000  0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

3 

,  1,  4, 

5,  6 

9 

7,  8, 

3 

,  2,  4, 

5,  6 

9 

7,  8, 

Here  atom  3, 

a  hydrogen 

9 

is  used  to 

define 

the  bond  lengths 

(symmetry  relation  1)  of  atoms  4, 5, 6, 7  and  8  with  the  atoms  they  are 
specified  to  bond  with  in  the  NA  column  of  the  data  file;  similarly,  its 
angle  (symmetry  relation  2)  is  used  to  define  the  bond-angle  of  atoms 
4, 5, 6, 7  and  8  with  the  two  atoms  specified  in  the  NA  and  NB  columns  of 
the  data  file.  The  other  angles  are  point-group  symmetry  defined  as  a 
multiple  of  60  degrees. 

Spaces,  tabs  or  commas  can  be  used  to  separate  data.  Note  that  only 
three  parameters  are  marked  to  be  optimized.  The  symmetry  data  can  be 
the  last  line  of  the  data  file  unless  more  data  follows,  in  which  case  a 
blank  line  must  be  inserted  after  the  symmetry  data. 
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The  £ull  list  of  available  symmetry  relations  is  as  follows: 


SYMMETRY  FUNCTIONS 


<Symmetry 

relation> 


1  BOND  LENGTH  IS  SET  EQUAL  TO  THE  REFERENCE  BOND  LENGTH 

2  BOND  ANGLE  IS  SET  EQUAL  TO  THE  REFERENCE  BOND  ANGLE 


3 

DIHEDRAL 

ANGLE 

IS  SET 

EQUAL  TO  THE  REFERENCE  DIHEDRAL  ANGLE 

4 

DIHEDRAL 

ANGLE 

VARIES 

AS 

90 

DEGREES 

- 

REFERENCE 

DIHEDRAL 

5 

DIHEDRAL 

ANGLE 

VARIES 

AS 

90 

DEGREES 

+ 

REFERENCE 

DIHEDRAL 

6 

DIHEDRAL 

ANGLE 

VARIES 

AS 

120 

DEGREES 

- 

REFERENCE 

DIHEDRAL 

7 

DIHEDRAL 

ANGLE 

VARIES 

AS 

120 

DEGREES 

+ 

REFERENCE 

DIHEDRAL 

8 

DIHEDRAL 

ANGLE 

VARIES 

AS 

180 

DEGREES 

- 

REFERENCE 

DIHEDRAL 

9 

DIHEDRAL 

ANGLE 

VARIES 

AS 

180 

DEGREES 

+ 

REFERENCE 

DIHEDRAL 

10 

DIHEDRAL 

ANGLE 

VARIES 

AS 

240 

DEGREES 

- 

REFERENCE 

DIHEDRAL 

11 

DIHEDRAL 

ANGLE 

VARIES 

AS 

240 

DEGREES 

+ 

REFERENCE 

DIHEDRAL 

12 

DIHEDRAL 

ANGLE 

VARIES 

AS 

270 

DEGREES 

- 

REFERENCE 

DIHEDRAL 

13 

DIHEDRAL 

ANGLE 

VARIES 

AS 

270 

DEGREES 

+ 

REFERENCE 

DIHEDRAL 

14 

DIHEDRAL 

ANGLE 

VARIES 

AS 

THE 

NEGATIVE  OF  THE  REFERENCE 

DIHEDRAL 

15  BOND  LENGTH  VARIES  AS  HALF  THE  REFERENCE  BOND  LENGTH 

16  BOND  ANGLE  VARIES  AS  HALF  THE  REFERENCE  BOND  ANGLE 

17  BOND  ANGLE  VARIES  AS  180  DEGREES  -  REFERENCE  BOND  ANGLE 

18  BOND  LENGTH  IS  A  MULTIPLE  OF  REFERENCE  BOND-LENGTH 

Function  18  is  intended  for  use  in  polymers,  in  which  the 
translation  vector  may  be  a  multiple  of  some  bond-length.  1,2,3  and  14 
are  most  commonly  used.  Abbreviation:  SYM. 

SYMMETRY  is  not  available  for  use  with  cartesian  coordinates. 


T=  (W) 

This  is  a  facility  to  allow  the  program  to  shut  down  in  an  orderly 
manner  on  computers  with  execution  time  C.P.U.  limits. 

The  total  C.P.U.  time  allowed  for  the  current  job  is  limited  to 
nn.nn  seconds;  by  default  this  is  one  hour,  i.e.,  3600  seconds.  If  the 
next  cycle  of  the  calculation  cannot  be  completed  without  running  a  risk 
of  exceeding  the  assigned  time  the  calculation  will  write  a  restart  file 
and  then  stop.  The  safety  margin  is  100  percent;  that  is,  to  do  another 
cycle,  enough  time  to  do  at  least  two  full  cycles  must  remain. 

Alternative  specifications  of  the  time  are  T=nn.nnM,  this  defines 
the  time  in  minutes,  T=nn.nnH,  in  hours,  and  T=nn.nnD,  in  days,  for  very 
long  jobs.  This  keyword  cannot  be  written  with  spaces  around  the  '  =  ' 
sign. 
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THERMO  (0) 

The  thermodynamic  quantities,  internal  energy,  heat  capacity, 
partition  function,  and  entropy  can  be  calculated  for  translation, 
rotation  and  vibrational  degrees  of  freedom  for  a  single  temperature,  or 
a  range  of  temperatures.  Special  situations  such  as  linear  systems  and 
transition  states  are  accommodated.  The  approximations  used  in  the 
THERMO  calculation  are  invalid  below  100K,  and  checking  of  the  lower 
bound  of  the  temperature  range  is  done  to  prevent  temperatures  of  less 
than  100K  being  used. 

Another  limitation,  for  which  no  checking  is  done,  is  that  there 
should  be  no  internal  rotations.  If  any  exist,  they  will  not  be 
recognized  as  such,  and  the  calculated  quantities  will  be  too  low  as  a 
result. 

In  order  to  use  THERMO  the  keyword  FORCE  must  also  be  specified,  as 
well  as  the  value  for  the  symmetry  number;  this  is  given  by  ROT-n. 

If  THERMO  is  specified  on  its  own,  then  the  default  values  of  the 
temperature  range  are  assumed.  This  starts  at  200K  and  increases  in 
steps  of  10  degrees  to  400K.  Three  options  exist  for  overriding  the 
default  temperature  range.  These  are: 


thermo (nnn)  (0) 

The  thermodynamic  quantities  for  a  200  degree  range  of  temperatures, 
starting  at  nnnK  and  with  an  interval  of  10  degrees  are  to  be  calculated. 


THERMO (nnn, mmm)  (0) 

The  thermodynamic  quantities  for  the  temperature  range  limited  by  a 
lower  bound  of  nnn  Kelvin  and  an  upper  bound  of  mmm  Kelvin,  the  step  size 
being  calculated  in  order  to  give  ap  ^oximately  20  points,  and  a 
reasonable  value  for  the  step.  The  s.  ®  of  the  step  in  Kelvin  degrees 
will  be  1,  2,  or  5,  or  a  power  of  10  times  these  numbers. 


THERMO (nnn, mmm, 111)  (0) 

Same  as  for  THERMO (nnn, mmm) ,  only  now  the  user  can  explicitly  define 
the  step  size.  The  step  size  cannot  be  less  than  IK. 


T-PRIORITY  (O) 

In  a  DRC  calculation,  results  will  be  printed  whenever  the 
calculated  time  changes  by  0.1  femtoseconds.  Abbreviation,  T-PRIO. 
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T-PRIORITY=n.nn  (0) 

In  a  DRC  calculation,  results  will  be  printed  whenever  the 
calculated  time  changes  by  n.nn  femtoseconds. 


TRANS  (C) 

The  imaginary  frequency  due  to  the  reaction  vector  in  a  transition 
state  calculation  must  not  be  included  in  the  thermochemical  calculation. 
The  number  of  genuine  vibrations  considered  can  be: 

3N-5  for  a  linear  ground  state  system, 

3N-6  for  a  non-linear  ground  state  system,  or 

3N-6  for  a  linear  transition-state  complex, 

3N-7  for  a  non-linear  transition-state  complex. 

This  keyword  must  be  used  in  conjunction  with  THERMO  if  a  transition 
state  is  being  calculated. 


TRANS=n  (C) 

The  facility  exists  to  allow  the  THERMO  calculation  to  handle 
systems  with  internal  rotations.  TRANS=n  will  remove  the  n  lowest 
vibrations.  Note  that  TRANS=1  is  equivalent  to  TRANS  on  its  own.  For 
xylene,  for  example,  TRANS=2  would  be  suitable. 

This  keyword  cannot  be  written  with  spaces  around  the  '='  sign. 


TRIPLET  (C) 

The  triplet  state  is  defined.  If  the  system  has  an  odd  number  of 
electrons,  an  error  message  will  be  printed. 

UHF  interpretation. 

The  number  of  alpha  electrons  exceeds  that  of  the  beta  electrons  by 
2.  If  TRIPLET  is  not  specified,  then  the  numbers  of  alpha  and  beta 
electrons  are  set  equal.  This  does  not  necessarily  correspond  to  a 
singlet . 

RHF  interpretation. 

An  RHF  MECI  calculation  is  performed  to  calculate  the  triplet  state. 
If  no  other  C.I.  keywords  are  used,  then  only  one  state  is  calculated  by 
default.  The  occupancy  of  the  M.O.'s  in  the  SCF  calculation  is  defined 
as  ( . . .2, 1,1,0, . . ) ,  that  is,  one  electron  is  put  in  each  of  the  two 
highest  occupied  M.O.'s. 
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See  keywords  C.I.=n  and  0PEN(nl,n2). 


TS  (C) 

Within  the  Eigenvector  Following  routine,  the  option  exists  to 
optimize  a  transition  state.  To  do  this,  use  TS.  Preliminary 
indications  are  that  the  TS  method  is  much  faster  and  more  reliable  than 
either  SIGMA  or  NLLSQ. 

TS  appears  to  work  well  with  cartesian  coordinates. 

In  the  event  that  TS  does  not  converge  on  a  stationary  point,  try 
adding  RECALC=5  to  the  keyword  line. 


UHF  (C) 

The  unrestricted  Hartree-Fock  Hamiltonian  is  to  be  used. 


VECTORS  (0) 

The  eigenvectors  are  to  be  printed.  In  UHF  calculations  both  alpha 
and  beta  eigenvectors  are  printed;  in  all  cases  the  full  set,  occupied 
and  virtual,  are  output.  The  eigenvectors  are  normalized  to  unity,  that 
is  the  sum  of  the  squares  of  the  coefficients  is  exactly  one.  If  DEBUG 
is  specified,  then  ALL  eigenvectors  on  every  iteration  of  every  SCF 
calculation  will  be  printed.  This  is  useful  in  a  learning  context,  but 
would  normally  be  very  undesirable. 


VELOCITY  (C) 

The  user  can  supply  the  initial  velocity  vector  to  start  a  DRC 
calculation.  Limitations  have  to  be  imposed  on  the  geometry  in  order  for 
this  keyword  to  work.  These  are  (a)  the  input  geometry  must  be  in 
cartesian  coordinates,  (b)  the  first  three  atoms  must  not  be  coaxial,  (c) 
triatomic  systems  are  not  allowed  (See  geometry  specification  -  triatomic 
systems  are  in  internal  coordinates,  by  definition.) 

Put  the  velocity  vector  after  the  geometry  as  three  data  per  line, 
representing  the  x,  y,  and  z  components  of  velocity  for  each  atom.  The 
units  of  velocity  are  centimeters  per  second. 

The  velocity  vector  will  be  rotated  so  as  to  suit  the  final 
cartesian  coordinate  orientation  of  the  molecule. 

If  KINETIC-n.n  is  also  specified,  the  velocity  vector  will  be  scaled 
to  equal  the  velocity  corresponding  to  n.n  kcal/mole.  This  allows  the 
user  to  define  the  direction  of  the  velocity  vector;  the  magnitude  is 
given  by  KINETIC=n.n. 
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WILLIAMS  (C) 

Within  the  ESP  calculation,  the  Connolly  surface  is  used  as  the 
default.  If  the  surface  generation  procedure  of  Donald  Williams  is 
wanted,  the  keyword  WILLIAMS  should  be  used. 


X-PRIORITY  (O) 

In  a  DRC  calculation,  results  will  be  printed  whenever  the 
calculated  geometry  changes  by  0.05  Angstroms.  The  geometry  change  is 
defined  as  the  linear  sum  of  the  translation  vectors  of  motion  for  all 
atoms  in  the  system.  Abbreviation,  X-PRIO. 


X-PRIORITY=n.nn  (O) 

In  a  DRC  calculation,  results  will  be  printed  whenever  the 
calculated  geometry  changes  by  n.nn  Angstroms. 


XYZ  (W) 

The  SADDLE  calculation  quite  often  fails  due  to  faulty  definition  of 
the  second  geometry  because  the  dihedrals  give  a  lot  of  difficulty.  To 
make  this  option  easier  to  use,  XYZ  was  developed.  A  calculation  using 
XYZ  runs  entirely  in  cartesian  coordinates,  thus  eliminating  the  problems 
associated  with  dihedrals.  The  connectivity  of  the  two  systems  can  be 
different,  but  the  numbering  must  be  the  same.  Dummy  atoms  can  be  used; 
these  will  be  removed  at  the  start  of  the  run.  A  new  numbering  system 
will  be  generated  by  the  program,  when  necessary. 

XYZ  is  also  useful  for  removing  dummy  atoms  from  an  internal 
coordinate  file;  use  XYZ  and  OSCF. 

If  a  large  ring  system  is  being  optimized,  sometimes  the  closure  is 
difficult,  in  which  case  XYZ  will  normally  work. 

Except  for  SADDLE,  do  not  use  XYZ  by  default:  use  it  only  when 

something  goes  wrong  1 

In  order  for  XYZ  to  be  used,  the  supplied  geometry  must  either  be  in 
cartesian  coordinates  or,  if  internal  coordinates  are  used,  symmetry  must 
not  be  used,  and  all  coordinates  must  be  flagged  for  optimization.  If 
dummy  atoms  are  present,  only  3N-6  coordinates  need  to  be  flagged  for 
optimization. 

If  at  all  possible,  the  first  3  atoms  should  be  real.  Except  in 
SADDLE,  XYZ  will  still  work  if  one  or  more  dummy  atoms  occur  before  the 
fourth  real  atom,  in  which  case  more  than  3N-6  coordinates  will  be 
flagged  for  optimization.  This  could  cause  difficulties  with  the  EF 
method,  which  is  why  dummy  atoms  at  the  start  of  the  geometry 
specification  should  be  avoided.  The  coordinates  to  be  optimized  depend 
on  the  internal  coordinate  definition  of  real  atoms  1,  2,  and  3.  If  the 
position  of  any  of  these  atoms  depends  on  dummy  atoms,  then  the 
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optimization  flags  will  be  different  from  the  case  where  the  first  three 
atoms  defined  are  all  real.  The  geometry  is  first  converted  to  cartesian 
coordinates  and  dummy  atoms  excluded.  The  cartesian  coordinates  to  be 
optimized  are: 


Atoms  R  R  R  R  R  X 

X  Y  Z  X  Y  Z 

Atom  1 

2  +  + 

3  +  +  +  +  + 

4  on  +  +  +  +  +  + 


RXR  XRR  R  X  X 

XYZ  XYZ  XYZ 

+  +  +  +  +  +  + 

+  +  +  +  +  +  +  +  + 

+  +  +  +  +  +  +  +  + 


X  R  X  X  X  R  XXX 

XYZ  XYZ  XYZ 

+  +  +  +  +  +  +  + 

+  +  +  +  +  +  +  +  + 

+  +  +  +  +  +  +  +  + 


Where  R  and  X  apply  to  real  and  dummy  atoms  in  the  internal  coordinate 
Z-matrixf  and  atoms  1,  2,  3,  and  4  are  the  real  atoms  in- cartesian 
coordinates.  A  '+'  means  that  the  relevant  coordinate  is  flagged  for 
optimization.  Note  that  the  number  of  flagged  coordinates  varies  from 
3N-6  to  3N-3,  atom  1  is  never  optimized. 


2.4  KEYWORDS  THAT  GO  TOGETHER 

Normally  only  a  subset  of  keywords  are  used  in  any  given  piece  of 
research.  Keywords  which  are  related  to  each  other  in  this  way  are: 

1.  In  getting  an  SCF:  SHIFT,  PULAY,  ITRY,  CAMP,  SCFCRT,  1SCF,  PL. 

2.  In  C.I.  work:  SINGLET,  DOUBLET,  etc.,  OPEN(n,m),  C.I.=(n,m), 
LARGE,  MECI,  MS=n,  VECTORS,  ESR,  ROOT=n,  MICROS. 

3.  In  excited  states:  UHF  with  (TRIPLET,  QUARTET,  etc.),  C.I.=n, 
C.I.=(n,m) . 

4.  In  geometry  optimization: 

1.  Using  BFGS:  GNORM=n.n,  XYZ,  PRECISE. 

2.  Using  EF:  GNORM=n . n ,  XYZ,  PRECISE 

3.  Using  NLLSQ:  GNORM=n. n,  XYZ,  PRECISE 

4.  Using  SIGMA:  GNORM=n.n,  XYZ,  PRECISE 

5.  In  Gaussian  work:  AIGIN,  AIGOUT,  AIDER. 

6.  In  SADDLE:  XYZ,  BAR=n . n 
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FORMAT:  The  geometry  is  read  in  using  essentially  "Free-Format"  of 
FORTRAN-77.  In  fact,  a  character  input  is  used  in  order  to  accommodate 
the  chemical  symbols,  but  the  numeric  data  can  be  regarded  as 
"f ree-format" .  This  means  that  integers  and  real  numbers  can  be 
interspersed,  numbers  can  be  separated  by  one  or  more  spaces,  a  tab 
and/or  by  one  comma.  If  a  number  is  not  specified,  its  value  is  set  to 
zero. 


The  geometry  can  be  defined  in  terms  of  either  internal  or  cartesian 
coordinates. 


3.1  INTERNAL  COORDINATE  DEFINITION 

For  any  one  atom  (i)  this  consists  of  an  interatomic  distance  in 
Angstroms  from  an  already-defined  atom  (j),  an  interatomic  angle  in 

degrees  between  atoms  i  and  j  and  an  already  defined  k,  (k  and  j  must  be 

different  atoms),  and  finally  a  torsional  angle  in  degrees  between  atoms 
if  j#  k,  and  an  already  defined  atom  1  (1  cannot  be  the  same  as  k  or  j). 
See  also  dihedral  angle  coherency. 

Exceptions: 

1.  Atom  1  has  no  coordinates  at  all:  this  is  the  origin. 

2.  Atom  2  must  be  connected  to  atom  1  by  an  interatomic  distance 

only. 

3.  Atom  3  can  be  connected  to  atom  1  or  2,  and  must  make  an  angle 

with  atom  2  or  1  (thus  -  3-2-1  or  3-1-2);  no  dihedral  is 

possible  for  atom  3.  By  default,  atom  3  is  connected  to  atom  2. 
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3.1.1  Constraints 


1.  Interatomic  distances  must  be  greater  than  zero.  Zero  Angstroms 
is  acceptable  only  if  the  parameter  is  symmetry-related  to 
another  atom,  and  is  the  dependent  function. 

2.  Angles  must  be  in  the  range  0.0  to  180.0,  inclusive.  This 
constraint  is  for  the  benefit  of  the  user  only;  negative  angles 
are  the  result  of  errors  in  the  construction  of  the  geometry, 
and  angles  greater  than  180  degrees  are  fruitful  sources  of 
errors  in  the  dihedrals. 

3.  Dihedrals  angles  must  be  definable.  If  atom  i  makes  a  dihedral 
with  atoms  j,  k,  and  1,  and  the  three  atoms  j,  k,  and  1  are  in  a 
straight  line,  then  the  dihedral  has  no  definable  angle.  During 
the  calculation  this  constraint  is  checked  continuously,  and  if 
atoms  j,  k,  and  1  lie  within  0.02  Angstroms  of  a  straight  line, 
the  calculation  will  output  an  error  message  and  then  stop.  Two 
exceptions  to  this  constraint  are: 

(a)  if  the  angle  is  zero  or  180  degrees,  in  which  case  the 
dihedral  is  not  used. 

(b)  if  atoms  j,  k,  and  1  lie  in  an  exactly  straight  line 
(usually  the  result  of  a  symmetry  constraint),  as  in  acetylene, 
acetonitrile,  but-2-yne,  etc. 


If  the  exceptions  are  used,  care  must  be  taken  to  ensure  that  the 
program  does  not  violate  these  constraints  during  any  optimizations  or 
during  any  calculations  of  derivatives  -  see  also  FORCE. 

Conversion  to  Cartesian  Coordinates 

By  definition,  atom  1  is  at  the  origin  of  cartesian  coordinate  space 
—  be  careful,  however,  if  atom  1  is  a  dummy  atom.  Atom  2  is  defined  as 
lying  on  the  positive  X  axis  —  for  atom  2,  Y=0  and  Z=0.  Atom  3  is  in 
the  X-Y  plane  unless  the  angle  3-2-1  is  exactly  0  or  180  degrees.  Atom 
4,  5,  6,  etc.  can  lie  anywhere  in  3-D  space. 


3 . 2  GAUSSIAN  Z-MATRICES 

With  certain  limitations,  geometries  can  now  be  specified  within 
MOPAC  using  the  Gaussian  Z-matrix  format. 


Exceptions  to  the  full  Gaussian  standard 

1.  The  option  of  defining  an  atom's  position  by  one  distance  and 
two  angles  is  not  allowed.  In  other  words,  the  N4  variable 
described  in  the  Gaussian  manual  must  either  be  zero  or  not 
specified.  MOPAC  requires  the  geometry  of  atoms  to  be  defined 
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in  terms  of,  at  most,  one  distance,  one  angle,  and  one  dihedral. 

2.  Gaussian  cartesian  coordinates  are  not  supported. 

3.  Chemical  symbols  must  not  be  followed  by  an  integer  identifying 
the  atom.  Numbers  after  a  symbol  are  used  by  MOPAC  to  indicate 
isotopic  mass.  If  labels  are  desired,  they  should  be  enclosed 
in  parentheses,  thus  "Cl(on  C5)34. 96885". 

4.  The  connectivity  (Nl,  N2,  N3)  must  be  integers.  Labels  are  not 
allowed. 


Specification  of  Gaussian  Z-matrices. 

The  information  contained  in  the  Gaussian  Z-matrix  is  identical  to 
that  in  a  MOPAC  Z-matrix.  The  order  of  presentation  is  different.  Atom 
N,  (real  or  dummy)  is  specified  in  the  format: 

Element  Nl  Length  N2  Alpha  N3  Beta 
where  Element  is  the  same  as  for  the  MOPAC  Z-matrix.  Nl,  N2,  and  N3  are 
the  connectivity,  the  same  as  the  MOPAC  Z-matrix  NA,  NB,  and  NC:  bond 
lengths  are  between  N  and  Nl,  angles  are  between  N,  Nl  and  N2,  and 
dihedrals  are  between  N,  Nl,  N2,  and  N3.  The  same  rules  apply  to  Nl,  N2, 
and  N3  as  to  NA,  NB,  and  NC. 

Length,  Alpha,  and  Beta  are  the  bond  lengths,  the  angle,  and 

dihedral.  They  can  be  'real',  e.g.  1.45,  109.4,  180.0,  or  'symbolic'. 
A  symbolic  is  an  alphanumeric  string  of  up  to  8  characters,  e.g.  R51, 
A512,  D5213,  CH,  CHO,  CHOC,  etc.  Two  or  more  symbolics  can  be  the  same. 

Dihedral  symbolics  can  optionally  be  preceeded  by  a  minus  sign,  in  which 

case  the  value  of  the  dihedral  is  the  negative  of  the  value  of  the 
symbolic.  This  is  the  equivalent  of  the  normal  MOPAC  SYMMETRY  operations 
1,  2,  3,  and  14. 

If  an  internal  coordinate  is  real,  it  will  not  be  optimized.  This 
is  the  equivalent  of  the  MOPAC  optimization  flag  "0".  If  an  internal 
coordinate  is  symbolic,  it  can  be  optimized. 

The  Z-matrix  is  terminated  by  a  blank  line,  after  which  comes  the 
starting  values  of  the  symbolics,  one  per  line.  If  there  is  a  blank  line 
in  this  set,  then  all  symbolics  after  the  blank  line  are  considered 
fixed;  that  is,  they  will  not  be  optimized.  The  set  before  the  blank 
line  will  be  optimized. 
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Example  of  Gaussian  Z-matrix  geometry  specification 

Line  1  AMI 
Line  2  Ethane 
Line  3 
Line  4  C 


Line 

5 

C 

1 

r21 

Line 

6 

H 

2 

r32 

1 

a321 

Line 

7 

H 

2 

r32 

1 

a321 

3 

<34  213 

tf 

Line 

8 

H 

2 

r32 

1 

a321 

3 

-d4213 

Line 

9 

H 

1 

r  32 

2 

a321 

3 

60. 

Line 

10 

H 

1 

r32 

2 

a321 

3 

180. 

Line 

11 

H 

1 

r32 

2 

a321 

3 

d300 

% 

Line 

12 

Line 

13 

r21 

1.5 

Line 

14 

r32 

1.1 

Line 

15 

a321 

109.5 

Line 

16 

d4313 

120.0 

Line 

17 

Line 

18 

d300 

300.0 

Line  19 


3.3  CARTESIAN  COORDINATE  DEFINITION 

A  definition  of  geometry  in  cartesian  coordinates  consists  of  the 
chemical  symbol  or  atomic  number,  followed  by  the  cartesian  coordinates 
and  optimization  flags  but  no  connectivity. 

MOPAC  uses  the  lack  of  connectivity  to  indicate  that  cartesian 
coordinates  are  to  be  used.  A  unique  case  is  the  triatomics  for  which 
only  internal  coordinates  are  allowed.  This  is  to  avoid  conflict  of 
definitions:  the  user  does  not  need  to  define  the  connectivity  of  atom 
2,  and  can  elect  to  use  the  default  connectivity  for  atom  3.  As  a 
result,  a  triatomic  may  have  no  explicit  connectivity  defined,  the  user 
thus  taking  advantage  of  the  default  connectivity.  Since  internal 
coordinates  are  more  commonly  used  than  cartesian,  the  above  choice  was 
made. 


If  the  keyword  XYZ  is  absent  every  coordinate  must  be  marked  for 
optimization.  If  any  coordinates  are  not  to  be  optimized,  the  keyword 
XYZ  must  be  present.  The  coordinates  of  all  atoms,  including  atoms  1,  2 
and  3  can  be  optimized.  Dummy  atoms  should  not  be  used,  for  obvious 
reasons. 


3.4  CONVERSION  BETWEEN  VARIOUS  FORMATS 

MOPAC  can  accept  any  of  the  following  formats:  cartesian,  MOPAC 
internal  coordinates,  and  Gaussian  internal  coordinates.  Both  MOPAC  and 
Gaussian  Z-matrices  can  also  contain  dummy  atoms.  Internally,  MOPAC 
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works  with  either  a  cartesian  coordinate  set  (if  XYZ  is  specified)  or 
internal  coordinates  (the  default).  If  the  OSCF  option  is  requested,  the 
geometry  defined  on  input  will  be  printed  in  MOPAC  Z-matrix  format,  along 
with  other  optional  formats. 

The  type(s)  of  geometry  printed  at  the  end  of  a  OSCF  calculation 
depend  only  on  the  keywords  XYZ,  AIGOUT,  and  NOXYZ.  The  geometry  printed 
is  independent  of  the  type  of  input  geometry,  and  therefore  makes  a 
convenient  conversion  mechanism. 

If  XYZ  is  present,  all  dummy  atoms  are  removed  and  the  internal 
coordinate  definition  remade.  All  symmetry  relations  are  lost  if  XYZ  is 
used. 

If  NOXYZ  is  present,  cartesian  coordinates  will  not  be  printed. 

If  AIGOUT  is  present,  a  data  set  using  Gaussian  Z-matrix  format  is 
printed. 

Note:  (1)  Only  if  the  keyword  XYZ  is  absent  and  the  keyword 
SYMMETRY  present  in  a  MOPAC  internal  coordinate  geometry,  or  two  or  more 
internal  coordinates  in  a  Gaussian  Z-matrix  have  the  same  symbolic  will 
symmetry  be  present  in  the  MOPAC  or  Gaussian  geometries  output.  (2)  This 
expanded  use  of  OSCF  replaces  the  program  DDUM,  supplied  with  earlier 
copies  of  MOPAC. 


3.5  DEFINITION  OF  ELEMENTS  AND  ISOTOPES 

Elements  are  defined  in  terms  of  their  atomic  numbers  or  their 
chemical  symbols,  case  insensitive.  Thus,  chlorine  could  be  specified  as 
17,  or  Cl.  In  Version  6,  only  main-group  elements  and  transition  metals 
for  which  the  'd*  shell  is  full  are  available. 

Acceptable  symbols  for  MNDO  are: 

Elements  Dummy  atom,  sparkles  and 

Translation  Vector 
H 


Li 

* 

B 

C 

N 

0 

F 

Na‘ 

* 

A1 

Si 

P 

s 

Cl 

+ 

o 

K' 

*  ... 

Zn 

* 

Ge 

* 

* 

Br 

XX  Cb 

++  +  —  -  Tv 

Rb' 

*  ... 

* 

* 

Sn 

* 

* 

I 

99  102 

103  104  105  106  107 

* 

*  ... 

Hg 

* 

Pb 

* 

1  These  symbols  refer  to  elements  which  lack  a  basis  set. 

+  This  is  the  dummy  atom  for  assisting  with  geometry  specification. 

*  Element  not  parameterized. 

o  This  is  the  translation  vector  for  use  with  polymers. 

Old  parameters  for  some  elements  are  available.  These  are  provided 
to  allow  compatibility  with  earlier  copies  of  MOPAC.  To  use  these  older 
parameters,  use  a  keyword  composed  of  the  chemical  symbol  followed  by  the 
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year  of  publication  of  the  parameters.  Keywords  currently  available: 
Sil978  S1978. 


For 

AMI 

,  acceptable 

symbols  are: 

Elements 

Dummy  atom,  sparkles  and 

Translation  Vector 

H 

* 

* 

B 

C 

N 

0 

F 

Na’ 

* 

A1 

Si 

P 

S 

Cl 

+  o 

K' 

* 

• 

. .  Zn  * 

Ge 

* 

* 

Br 

XX  Cb  ++  +  —  -  Tv 

Rb' 

* 

• 

*  * 

•  • 

Sn 

* 

* 

I 

99  102  103  104  105  106  107 

* 

* 

• 

.  .  Hg  * 

* 

* 

If  users  need  to  use  other  elements,  such  as  beryllium  or  lead,  they 
can  be  specified,  in  which  case  MNDO-type  atoms  will  be  used.  As  the 
behavior  of  such  systems  is  not  well  investigated,  users  are  cautioned  to 
exercise  unusual  care.  To  alert  users  to  this  situation,  the  keyword 
PARASOK  is  defined. 

For  PM3 ,  acceptable  symbols  are: 

Elements  Dummy  atom,  sparkles  and 

Translation  Vector 

H 


* 

Be 

* 

C 

N 

0 

F 

Na' 

Mg 

Al 

Si 

P 

S 

Cl 

+ 

o 

K' 

*  .  .  . 

Zn 

Ga 

Ge 

As 

Se 

Br 

XX  Cb 

++  +  —  -  Tv 

Rb' 

*  .  .  . 

Cd 

In 

Sn 

Sb 

Te 

I 

99  102 

103  104  105  106  107 

* 

*  ... 

Hg 

T1 

Pb 

Bi 

H 

B 

C 

N 

0 

F 

Si 

P 

S 

Cl 


Diatomics  Parameterized  within  the  MINDO/3  Formalism 


H 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


B 


* 

* 

* 

* 

* 

* 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


N 


0 


Si 


* 

* 


* 

* 


Cl 


* 

* 


* 

* 

* 


A  star  (*)  indicates 
that  the  atom-pair  is 
parameterized  within 
MINDO/3. 


Note:  MINDO/3  should  now  be  regarded  as  being  of  historical 
interest  only.  MOPAC  contains  the  original  parameters.  These  do  not 
reproduce  the  original  reported  results  in  the  case  of  P,  Si,  or  S.  The 
original  work  was  faulty,  see  G.  Frenking,  H.  Goetz,  and  F.  Marschner, 
J.A.C.S.,  100,  5295  (1978).  Re-optimized  parameters  for  P-C  and  P-Cl 
were  derived  later  which  gave  better  results.  These  are: 
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Alpha (P-C) :  0.8700 
Beta(P-C) :  0.5000 
Alpha (P-Cl):  1.5400 
Beta(P-Cl) :  0.2800 


G.  Frenking,  H.  Goetz,  F.  Marschner, 
J.A.C.S.,  100,  5295-5296  (1978). 

G.  Frenking,  F.  Marschner,  H.  Goetz, 
Phosphorus  and  Sulfur,  8,  337-342  (1980). 


Although  better  than  the  original  parameters,  these  have  not  been 
adopted  within  MOPAC  because  to  do  so  at  this  time  would  prevent  earlier 
calculations  from  being  duplicated.  Parameters  for  P-0  and  P-F  have  been 
added:  these  were  abstracted  from  Frenking 's  1980  paper.  No 
inconsistency  is  involved  as  MINDO/3  historically  did  not  have  P-0  or  P-F 
parameters. 


Extra  entities  available  to  MNDO,  MINDO/3,  AMI  and  PM3 
+  A  100%  ionic  alkali  metal. 

++  A  100%  ionic  alkaline  earth  metal. 

A  100%  ionic  halogen-like  atom 
A  100%  ionic  group  Vl-like  atom. 

Cb  A  special  type  of  monovalent  atom 

Elements  103,  104,  105,  and  106  are  the  sparkles;  elements  11  and  19 
are  sparkles  tailored  to  look  like  the  alkaline  metal  ions;  Tv  is  the 
translation  vector  for  polymer  calculations.  See  "Full  description  of 
sparkles" . 


Element  102,  symbol  Cb,  is  designed  to  satisfy  valency  requirements 
of  atoms  for  which  some  bonds  are  not  completed.  Thus  in  "solid"  diamond 
the  usual  way  to  complete  the  normal  valency  in  a  cluster  model  is  to  use 
hydrogen  atoms.  This  approach  has  the  defect  that  the  electronegativity 
of  hydrogen  is  different  from  that  of  carbon.  The  "Capped  bond"  atom, 
Cb,  is  designed  to  satisfy  these  valency  requirements  without  acquiring  a 
net  charge. 

Cb  behaves  like  a  monovalent  atom,  with  the  exception  that  it  can 
alter  its  electronegativity  to  achieve  an  exactly  zero  charge  in  whatever 
environment  it  finds  itself.  It  is  thus  all  things  to  all  atoms.  On 
bonding  to  hydrogen  it  behaves  similar  to  a  hydrogen  atom.  On  bonding  to 
fluorine  it  behaves  like  a  very  electronegative  atom.  If  several  capped 
bond  atoms  are  used,  each  will  behave  independently.  Thus  if  the  two 
hydrogen  atoms  in  formic  acid  were  replaced  by  Cb's  then  each  Cb  would 
independently  become  electroneutral. 

Capped  bonds  internal  coordinates  should  not  be  optimized.  A  fixed 
bond-length  of  1.7  Angstroms  is  recommended,  if  two  Cb  are  on  one  atom,  a 
contained  angle  of  109.471221  degrees  is  suggested,  and  if  three  Cb  are 
on  one  atom,  a  contained  dihedral  of  -120  degrees  (note  sign)  should  be 
used. 


Element  99,  X,  or  XX  is  known  as  a  dummy  atom,  and  is  used  in  the 
definition  of  the  geometry;  it  is  deleted  automatically  from  any 
cartesian  coordinate  geometry  files.  Dummy  atoms  are  pure  mathematic 
points,  and  are  useful  in  defining  geometries;  for  example,  in  ammonia 
the  definition  of  C3v  symmetry  is  facilitated  by  using  one  dummy  atom  and 
symmetry  relating  the  three  hydrogens  to  it. 
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Output  normally  only  gives  chemical  symbols. 

Isotopes  are  used  in  conjunction  with  chemical  symbols.  If  no 
isotope  is  specified,  the  average  isotopic  mass  is  used,  thus  chlorine  is 
35.453.  This  is  different  from  some  earlier  versions  of  MOPAC,  in  which 
the  most  abundant  isotope  was  used  by  default.  This  change  was  justified 
by  the  removal  of  any  ambiguity  in  the  choice  of  isotope.  Also,  the 
experimental  vibrational  spectra  involve  a  mixture  of  isotopes.  If  a 
user  wishes  to  specify  any  specific  isotope  it  should  immediately  follow 
the  chemical  symbol  (no  space),  e.g.,  H2,  H2.0140,  C(meta)13,  or 
C13. 00335. 


The  sparkles  ++,  +,  — ,  and  -  have  no  mass;  if  they  are  to  be  used 
in  a  force  calculation,  then  appropriate  masses  should  be  used. 


Each  internal  coordinate  is  followed  by  an  integer,  to  indicate  the 
action  to  be  taken. 

Integer  Action 


1 

0 

-1 


Optimize  the  internal  coordinate. 

Do  not  optimize  the  internal  coordinate. 
Reaction  coordinate,  or  grid  index. 


Remarks: 


Only  one  reaction  coordinate  is  allowed,  but  this  can  be  made  more 
versatile  by  the  use  of  SYMMETRY.  If  a  reaction  coordinate  is  used,  the 
values  of  the  reaction  coordinate  should  follow  immediately  after  the 
geometry  and  any  symmetry  data.  No  terminator  is  required,  and 
free-format-type  input  is  acceptable. 

If  two  "reaction  coordinates"  are  used,  then  MOPAC  assumes  that  the 
two-dimensional  space  in  the  region  of  the  supplied  geometry  is  to  be 
mapped.  The  two  dimensions  to  be  mapped  are  in  the  plane  defined  by  the 
" — 1 "  labels.  Step  sizes  in  the  two  directions  must  be  supplied  using 
STEP1  and  STEP 2  on  the  keyword  line. 

Using  internal  coordinates,  the  first  atom  has  three  unoptimizable 
coordinates,  the  second  atom  two,  (the  bond-length  can  be  optimized)  and 
the  third  atom  has  one  unoptimizable  coordinate.  None  of  these  six 
unoptimizable  coordinates  at  the  start  of  the  geometry  should  be  marked 
for  optimization.  If  any  are  so  marked,  a  warning  is  given,  but  the 
calculation  will  continue. 

In  cartesian  coordinates  all  parameters  can  be  optimized. 


3.6  EXAMPLES  OF  COORDINATE  DEFINITIONS. 

Two  examples  will  be  given.  The  first  is  formic  acid,  HCOOH,  and  is 
presented  in  the  normal  style  with  internal  coordinates.  This  is 
followed  by  formaldehyde,  presented  in  such  a  manner  as  to  demonstrate  as 
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many  different  features  of  the  geometry  definition  as  possible. 

MINDO/3 
Formic  acid 

Example  of  normal  geometry  definition 

O  Atom  1  needs  no  coordinates. 


c 

1.20 

1 

Atom 

2  bonds  to  atom  1. 

0 

1.32 

1 

116.8 

1 

2 

1 

Atom 

3  bonds  to  atom  2  and 

makes 

i  an  angle  with  atom  1. 

H 

0.98 

1 

123.9 

1 

o 

• 

o 

0 

3 

2 

1 

Atom 

4  has  a  dihedral  of  0.0 

with 

atoms  3,  2  and  1. 

H 

1.11 

1 

127.3 

1 

180.0 

0 

2 

1 

3 

0 

0.00 

0 

0.0 

0 

0.0 

0 

0 

0 

0 

Atom  2,  a  carbon,  is  bonded  to  oxygen  by  a  bond-length  of  1.20 
Angstroms,  and  to  atom  3,  an  oxygen,  by  a  bond-length  of  1.32  Angstroms. 
The  O-C-O  angle  is  116.8  degrees.  The  first  hydrogen  is  bonded  to  the 
hydroxyl  oxygen  and  the  second  hydrogen  is  bonded  to  the  carbon  atom. 
The  H-C-0-0  dihedral  angle  is  180  degrees. 


S 


MOPAC  can  generate  data-files,  both  in  the  Archive  files,  and  at 
the  end  of  the  normal  output  file,  when  a  job  ends  prematurely  due  to 
time  restrictions.  Note  that  the  data  are  all  neatly  lined  up.  This 
is,  of  course,  characteristic  of  machine-generated  data,  but  is  useful 
when  checking  for  errors. 
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Format  of  internal  coordinates  in  ARCHIVE  file 


O 

C 

o 

H 

H 

0 


0.000000  0 
1.209615  1 
1.313679  1 
0.964468  1 
1.108040  1 
0.000000  0 


0.000000  0 
0.000000  0 
116.886168  1 
115.553316  1 
128.726078  1 
0.000000  0 


0.000000  0 

0.000000  0 

0.000000  0 

0.000000  0 

180.000000  0 

0.000000  0 


0  0  0 

10  0 
2  10 

3  2  1 

2  13 

0  0  0 


Polymers  are  defined  by  the  presence  of  a  translation  vector.  In 
the  following  example,  polyethylene,  the  translation  vector  spans  three 
monomeric  units,  and  is  7.7  Angstroms  long.  Note  in  this  example  the 
presence  of  two  dummy  atoms.  These  not  only  make  the  geometry 
definition  easier  but  also  allow  the  translation  vector  to  be  specified 
in  terms  of  distance  only,  rather  than  both  distance  and  angles. 


Example  of  polymer  coordinates  from  ARCHIVE  file 


T=20000 

POLYETHYLENE,  CLUSTER  UNIT  :  C6H12 


C  0.000000 

C  1.540714 

C  1.542585 

C  1.542988 

C  1.545151 

C  1.541777 

XX  1.542344 

XX  1.540749 

H  1.114786 

H  1.114512 

H  1.114687 

H  1.114748 

H  1.114474 

H  1.114433 

H  1.114308 

H  1.114434 

H  1.114534 

H  1.114557 

H  1.114734 

H  1.115150 

Tv  7.746928 

0  0.000000 


0  0.000000 

1  0.000000 

1  113.532306 

1  113.373490 

1  113.447508 

1  113.859804 

1  108.897076 

1  108.360151 

1  90.070026 

1  90.053136 

1  90.032722 

1  89.975504 

1  90.063308 

1  89.915262 

1  90.028131 

1  90.189506 

1  88.522263 

1  88.707407 

1  90.638631 

1  91.747016 

1  0.000000 

0  0.000000 


0  0.000000 
0  0.000000 
1  0.000000 
1  179.823613 

1  179.811764 

1  -179.862648 
1  -179.732346 
1  -178.950271 
1  126.747447 

1  -127.134856 
1  126.717889 

1  -127.034513 
1  126.681098 

1  -126.931090 
1  127.007845 

1  -126.759550 
1  127.041363 

1  -126.716355 
1  127.793055 

1  -126.187496 
0  0.000000 
0  0.000000 


0  0  0  0 

0  10  0 

0  2  10 

1  3  2  1 

1  4  3  2 

1  5  4  3 

1  6  5  4 

1  7  6  5 

1  13  2 

1  13  2 

1  2  4  3 

1  2  4  3 

1  3  5  4 

1  3  5  4 

1  4  6  5 

1  4  6  5 

1  5  7  6 

1  5  7  6 

1  6  8  7 

1  6  8  7 

0  17  8 

0  0  0  0 
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EXAMPLES 


In  this  chapter  various  examples  of  data-files  are  described.  With 
MOPAC  comes  two  sets  of  data  for  running  calculations.  One  of  these  is 
called  MNRSD1.DAT,  and  this  will  now  be  described. 


4.1  MNRSD1  TEST  DATA  FILE  FOR  FORMALDEHYDE 

The  following  file  is  suitable  for  generating  the  results  described 
in  the  next  section,  and  would  be  suitable  for  debugging  data. 


Line 

Is 

SYMMETRY 

Line 

2: 

Formaldehyde,  for  Demonstration  Purposes 

Line 

3: 

Line 

4: 

0 

Line 

5: 

C  1.2  1 

Line 

6: 

H  1.1  1  120  1 

Line 

7s 

H  1.1  0  120  0  180  0  2  1  3 

Line 

8: 

Line 

9: 

.  1  4 

Line 

10: 

3  2  4 

Line 

11: 
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These  data  could  be  more  neatly  written  as 


Line 

1: 

SYMMETRY 

Line 

2: 

Formaldehyde, 

for  Demonstration  Purposes 

Line 

3: 

Line 

4: 

0 

Line 

5: 

C 

1.20 

1 

1 

Line 

6: 

B 

1.10 

1 

120.00  1  21 

Line 

7: 

B 

1.10 

0 

120.00  0  180.00  021 

Line 

8: 

Line 

9: 

3, 

1, 

4, 

Line 

10: 

3, 

2, 

4, 

Line 

11: 

These  two  data-files  will  produce  identical  results  fiJtes. 

In  all  geometric  specifications,  care  must  be  taken  in  defining  the 
internal  coordinates  to  ensure  that  no  three  atoms  being  used  to  define 
a  fourth  atom's  dihedral  angle  ever  fall  into  a  straight  line.  This  can 
happen  in  the  course  of  a  geometry  optimization,  in  a  SADDLE  calculation 
or  in  following  a  reaction  coordinate.  If  such  a  condition  should 
develop,  then  the  position  of  the  dependent  atom  would  become 
ill-defined. 


4.2  MOPAC  OUTPUT  FOR  TEST-DATA  FILE  MNRSD1 

**************************************************************************** 

**  FRANK  J.  SEILER  RES.  LAB.,  U.S.  AIR  FORCE  ACADEMY,  COLO.  SPGS.,  CO.  80840 
**************************************************************************** 

MNDO  CALCULATION  RESULTS  Note  1 


**************************************************************************** 

*  MOPAC:  VERSION  6.00  CALC'D.  4-OCT-90  Note  2 

*  SYMMETRY  -  SYMMETRY  CONDITIONS  TO  BE  IMPOSED 

*  T=  -  A  TIME  OF  3600.0  SECONDS  REQUESTED 

*  DUMP-N  -  RESTART  FILE  WRITTEN  EVERY  3600.0  SECONDS 

**************************************** ****************************043BY043 


PARAMETER  DEPENDENCE  DATA 

REFERENCE  ATOM  FUNCTION  NO.  DEPENDENT  ATOM(S) 

3  14 

3  2  4 


62 


EXAMINES 


Page  4-3 


DESCRIPTIONS  OF  THE  FUNCTIONS  USED 

1  BOND  LENGTH  IS  SET  EQUAL  TO  THE  REFERENCE  BOND  LENGTH 

2  BOND  ANGLE  IS  SET  EQUAL  TO  THE  REFERENCE  BOND  ANGLE 

SYMMETRY  Note  3 

Formaldehyde/  for  Demonstration  Purposes 


ATOM 

CHEMICAL 

BOND  LENGTH 

BOND  ANGLE 

TWIST  ANGLE 

NUMBER 

SYMBOL 

(ANGSTROMS) 

(DEGREES) 

(DEGREES) 

(I) 

NA:  I 

NB : NA : I 

NC : NB : NA : I 

NA 

NB  NC 

1 

O 

Note  4 

2 

C 

1.20000  * 

1 

3 

H 

1.10000  * 

120.00000  * 

2 

1 

4 

H 

1.10000 

120.00000 

180.00000 

2 

1  3 

CARTESIAN  COORDINATES 


NO. 

ATOM 

X 

Y 

Z 

1 

O 

0.0000 

0.0000 

0.0000 

2 

C 

1.2000 

0.0000 

0.0000 

Note 

3 

H 

1.7500 

0.9526 

0.0000 

4 

H 

1.7500 

-0.9526 

0.0000 

H:  (MNDO) : 

M.J.S. 

DEWAR,  W. 

THIEL,  J. 

AM.  CHEM. 

SOC., 

99,  4899, 

(1977) 

C:  (MNDO): 

M.J.S. 

DEWAR,  W. 

THIEL,  J. 

AM.  CHEM. 

SOC., 

99,  4899, 

(1977) 

O:  (MNDO): 

M.J.S. 

DEWAR,  W. 

THIEL,  J. 

AM.  CHEM. 

SOC., 

99,  4899, 

(1977) 

RHF 

CALCULATION,  NO. 

OF  DOUBLY 

OCCUPIED 

LEVELS 

=  6 

INTERATOMIC  DISTANCES 

01  C  2  H  3  H  4 


O  1  0.000000 

C  2  1.200000  0.000000 

H  3  1.992486  1.100000 

H  4  1.992486  1.100000 

CYCLE:  1  TIME:  0.75  TIME  LEFT: 

CYCLE:  2  TIME:  0.37  TIME  LEFT: 

HEAT  OF  FORMATION  TEST  SATISFIED 
PETERS  TEST  SATISFIED 


0.000000 

1.905256  0.000000 

3598.2  GRAD. : 
3597.8  GRAD.: 


Note  6  | 

6.349  HEAT:-32. 840147 
2.541  HEAT:-32. 880103 
Note  7 
Note  8 


SYMMETRY 

Formaldehyde/  for  Demonstration  Purposes 


Note  9 
Note  10 


PETERS  TEST  WAS  SATISFIED  IN  BFGS 
SCF  FIELD  WAS  ACHIEVED 
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UNDO  CALCULATION  INote  13 

VERSION  6.00 
4-OCT-90 


PINAL  HEAT  OF  FORMATION  =  -32.88176  KCAL  Note  14 


TOTAL  ENERGY  =  -478.11917  EV 

ELECTRONIC  ENERGY  =  -870.69649  EV 

CORE-CORE  REPULSION  =  392.57733  EV 

IONIZATION  POTENTIAL  »  11.04198 

NO.  OF  FILLED  LEVELS  *  6 

MOLECULAR  HEIGHT  =  30.026 


SCF  CALCULATIONS  =  15 

COMPUTATION  TIME  =  2.740  SECONDS  Note  15 


ATOM  CHEMICAL  BOND  LENGTH  BOND  ANGLE  TWIST  ANGLE 
NUMBER  SYMBOL  (ANGSTROMS)  (DEGREES)  (DEGREES) 

(I)  NA:  I  NB : NA :  I  NC:NB:NA:I  NA  NB  NC 

1  O 

2  C  1.21678  *  1  Note  16 

3  H  1.10590  *  123.50259  *  21 

4  H  1.10590  123.50259  180.00000  213 


INTERATOMIC  DISTANCES 

01  C  2  H  3  H  4 


O  1  0.000000 

C  2  1.216777  0.000000 

H  3  2.046722  1.105900  0.000000 

H  4  2.046722  1.105900  1.844333  0.000000 


64 


EXAMPLES 


Page  4-5 


EIGENVALUES 


-42.98352  -25.12201  -16.95327  -16.29819  -14.17549  -11.04198  0.85804  3.6768 

3.84990  7.12408  Note  17 


NET  ATOMIC  CHARGES  AND  DIPOLE  CONTRIBUTIONS 


ATOM 

NO.  TYPE 

CHARGE 

ATOM  ELECTRON  DENSITY 

1 

O 

-0.2903 

6.2903 

2 

C 

0.2921 

3.7079  Note 

18 

3 

H 

-0.0009 

1.0009 

4 

H 

-0.0009 

1.0009 

DIPOLE 

X 

Y 

Z 

TOTAL 

POINT-CHG. 

1.692 

0.000 

0.000 

1.692 

HYBRID 

0.475 

0.000 

0.000 

0.475  Note 

19 

SUM 

2.166 

0.000 

0.000 

2.166 

NO. 

CARTESIAN  COORDINATES 

ATOM 

X 

Y 

Z 

1 

O 

0.0000 

0.0000 

0.0000 

2 

C 

1.2168 

0.0000 

0.0000 

3 

H 

1.8272 

0.9222 

0.0000 

4 

H 

1.8272 

-0.9222 

0.0000 

ATOMIC  ORBITAL  ELECTRON  POPULATIONS 

1.88270  1.21586  1.89126  1.30050  1.25532  0.86217  0.89095  0.69950 

1.00087  1.00087  Note  20 


TOTAL  CPU  TIME:  3.11  SECONDS 

==  MOPAC  DONE  == 

NOTES  ON  RESULTS  FILE 

NOTE  1:  The  banner  indicates  whether  the  calculation  uses  a  MNDO, 
MINDO/3 ,  AMI  or  PM3  Hamiltonian;  here,  the  default  MNDO  Hamiltonian  is 
used. 


NOTE  2:  The  Version  number  is  a  constant  for  any  release  of  MOPAC, 
and  refers  to  the  program,  not  to  the  Hamiltonians  used.  The  version 
number  should  be  cited  in  any  correspondence  regarding  MOPAC.  Users' 
own  in-house  modified  versions  of  MOPAC  will  have  a  final  digit 
different  from  zero,  e.g.  6.01. 

All  the  keywords  used,  along  with  a  brief  explanation,  should  be 
printed  at  this  time.  If  a  keyword  is  not  printed,  it  has  not  been 
recognized  by  the  program.  Keywords  can  be  in  upper  or  lower  case 
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letters#  or  any  mixture.  The  cryptic  message  at  the  right  end  of  the 
lower  line  of  asterisks  indicates  the  number  of  heavy  and  light  atoms 
this  version  of  MOPAC  is  configured  for. 

NOTE  3:  Symmetry  information  is  output  to  allow  the  user  to  verify 
that  the  requested  symmetry  functions  have  in  fact  been  recognized  and 
used. 


NOTE  4:  The  data  for  this  example  used  a  mixture  of  atomic  numbers 
and  chemical  symbols#  but  the  internal  coordinate  output  is  consistently 
in  chemical  symbols. 

The  atoms  in  the  system  are,  in  order: 

Atom  1,  an  oxygen  atom;  this  is  defined  as  being  at  the 
origin. 

Atom  2,  the  carbon  atom.  Defined  as  being  1.2  Angstroms 
from  the  oxygen  atom#  it  is  located  in  the  +x  direction.  This 
distance  is  marked  for  optimization. 

Atom  3#  a  hydrogen  atom.  it  is  defined  as  being  1.1 
Angstroms  from  the  carbon  atom#  and  making  an  angle  of  120 
degrees  with  the  oxygen  atom.  The  asterisks  indicate  that  the 
bond  length  and  angle  are  both  to  be  optimized. 

Atom  4,  a  hydrogen  atom.  The  bond  length  supplied  has  been 
overwritten  with  the  symmetry-defined  C-H  bond  length.  Atom  4  is 
defined  as  being  1.1  Angstroms  from  atom  2,  making  a  bond-angle 
of  120  degrees  with  atom  1,  and  a  dihedral  angle  of  180  degrees 
with  atom  3. 

None  of  the  coordinates  of  atom  4  are  marked  for 
optimization.  The  bond-length  and  angle  are  symmetry-defined  by 
atom  3#  and  the  dihedral  is  group-theory  symmetry-defined  as 
being  180  degrees.  (The  molecule  is  flat.) 

NOTE  5:  The  cartesian  coordinates  are  calculated  as  follows: 

Stage  1:  The  coordinate  of  the  first  atom  is  defined  as 
being  at  the  origin  of  cartesian  space#  while  the  coordinate  of 
the  second  atom  is  defined  as  being  displaced  by  its  defined  bond 
length  along  the  positive  x-axis.  The  coordinate  of  the  third 
atom  is  defined  as  being  displaced  by  its  bond  length  in  the  x-y 
plane#  from  either  atom  1  or  2  as  defined  in  the  data#  or  from 
atom  2  if  no  numbering  is  given.  The  angle  it  makes  with  atoms  1 
and  2  is  that  given  by  its  bond  angle. 

The  dihedral,  which  first  appears  in  the  fourth  atom,  is 
defined  according  to  the  I.U.P.A.C.  convention.  NOTE:  This  is 
different  from  previous  versions  of  MNDO  and  MINDO/3#  where  the 
dihedral  had  the  opposite  chirality  to  that  defined  by  the 
I.U.P.A.C.  convention. 
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Stage  2:  Any  dummy  atoms  are  removed.  As  this  particular 
system  contains  no  dummy  atoms,  nothing  is  done. 

NOTE  6:  The  interatomic  distances  are  output  for  the  user's 
advice,  and  a  simple  check  made  to  insure  that  the  smallest  interatomic 
distance  is  greater  than  0.8  Angstroms. 

NOTE  7:  The  geometry  is  optimized  in  a  series  of  cycles,  each 
cycle  consisting  of  a  line  search  and  calculation  of  the  gradients.  The 
time  given  is  the  C.P.U.  time  for  the  cycle;  time  left  is  the  total 
time  requested  (here  100  seconds)  less  the  C.P.U.  time  since  the  start 
of  the  calculation  (which  is  earlier  than  the  start  of  the  first 
cycle!).  These  times  can  vary  slightly  from  cycle  to  cycle  due  to 
different  options  being  used,  for  example  whether  or  not  two  or  more  SCF 
calculations  need  to  be  done  to  ensure  that  the  heat  of  formation  is 
lowered.  The  gradient  is  the  scalar  length  in  kcal/mole/Angstrom  of  the 
gradient  vector. 

NOTE  8:  At  the  end  of  the  BFGS  geometry  optimization  a  message  is 
given  which  indicates  how  the  optimization  ended.  All  "normal" 
termination  messages  contain  the  word  "satisfied";  other  terminations 
may  give  acceptable  results,  but  more  care  should  be  taken,  particularly 
regarding  the  gradient  vector. 

NOTE  9  and  10:  The  keywords  used,  titles  and  comments  are 

reproduced  here  to  remind  the  user  of  the  name  of  the  calculation. 

NOTES  11  and  12:  Two  messages  are  given  here.  The  first  is  a 
reminder  of  how  the  geometry  was  obtained,  whether  from  the 
|  Broyden-Fletcher-Goldfarb-Shanno,  Eigenvector  Following,  Bartel's  or  the 
Mclver-Komornicki  methods.  For  any  further  results  to  be  printed  the 
second  message  must  be  as  shown;  when  no  SCF  is  obtained  no  results  will 
be  printed. 

NOTE  13:  Again,  the  results  are  headed  with  either  MNDO  or  MINDO/3 
banners,  and  the  version  number.  The  date  has  been  moved  to  below  the 
version  number  for  convenience. 

NOTE  14:  The  total  energy  of  the  system  is  the  addition  of  the 
electronic  and  nuclear  terms.  The  heat  of  formation  is  relative  to  the 
elements  in  their  standard  state.  The  I.P.  is  the  negative  of  the 
energy  level  of  the  highest  occupied,  or  highest  partially  occupied 
molecular  orbital  (in  accordance  with  Koopmans'  theorem). 

NOTE  15:  Advice  on  time  required  for  the  calculation.  This  is 
obviously  useful  in  estimating  the  times  required  for  other  systems. 

NOTE  16:  The  fully  optimized  geometry  is  printed  here.  If  a 
parameter  is  not  marked  for  optimization,  it  will  not  be  changed  unless 
it  is  a  symmetry-related  parameter. 

NOTE  17:  The  roots  are  the  eigenvalues  or  energy  levels  in 
electron  volts  of  the  molecular  orbitals.  There  are  six  filled  levels, 
therefore  the  HOMO  has  an  energy  of  -11.041eV;  analysis  of  the 
corresponding  eigenvector  (not  given  here)  shows  that  it  is  mainly 
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lone-pair  on  oxygen.  The  eigenvectors  form  an  orthonormal  set. 

MOTE  18:  The  charge  on  an  atom  is  the  sum  of  the  positive  core 
charge;  for  hydrogen,  carbon,  and  oxygen  these  numbers  are  1.0,  4.0,  and 
6.0,  respectively,  and  the  negative  of  the  number  of  valence  electrons, 
or  atom  electron  density  on  the  atom,  here  1.0010,  3.7079,  and  6.2902 
respectively. 

NOTE  19:  The  dipole  is  the  scalar  of  the  dipole  vector  in 

cartesian  coordinates.  The  components  of  the  vector  coefficients  are 
the  point-charge  dipole  and  the  hybridization  dipole.  In  formaldehyde 
there  is  no  z-dipole  since  the  molecule  is  flat. 

NOTE  20:  MNDO  AMI,  PM3,  and  MINDO/3  all  use  the  Coulson  density 

matrix.  Only  the  diagonal  elements  of  the  matrix,  representing  the 

valence  orbital  electron  populations,  will  be  printed,  unless  the 
keyword  DENSITY  is  specified. 

Two  extra  lines  are  added  as  a  result  of  user  requests: 

(1)  The  total  CPU  time  for  the  job,  excluding  loading  of  the 
executable,  is  printed. 

(2)  In  order  to  know  that  MOPAC  has  ended,  the  message 

==  MOPAC  DONE  ==  is  printed. 
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TESTDATA.DAT,  supplied  with  MOPAC  6.00,  is  a  single  large  job 
consisting  of  several  small  systems,  which  are  run  one  after  the  other, 
[n  order,  the  calculations  run  are 

1.  A  FORCE  calculation  on  formaldehyde.  The  extra  keywords  at  the 
start  are  to  be  used  later  when  TESTDATA.DAT  acts  as  a  SETUP 
file.  This  unusual  usage  of  a  data  set  was  made  necessary  by 
the  need  to  ensure  that  a  SETUP  file  existed.  If  the  first  two 
lines  are  removed,  the  data  set  used  in  the  example  given  below 
is  generated. 

2.  The  vibrational  frequencies  of  a  highly  excited  dication  of 
methane  are  calculated.  A  non-degenerate  state  was  selected  in 
order  to  preserve  tetrahedral  symmetry  (to  avoid  the 
Jahn-Teller  effects). 

3.  Illustration  of  the  use  of  the  in  the  keyword  line,  and  of 
the  new  optional  definition  of  atoms  2  and  3 

4.  Illustration  of  Gaussian  Z-matrix  input. 

5.  An  example  of  Eigenvector  Following,  to  locate  a  transition 
state. 

6.  Use  of  SETUP.  Normally,  SETUP  would  point  to  a  special  file 
which  would  contain  keywords  only.  Here,  the  only  file  we  can 
guarantee  exists,  is  the  file  being  run,  so  that  is  the  one 
used. 

7.  Example  of  labelling  atoms. 

8.  This  part  of  the  test  writes  the  density  matrix  to  disk,  for 
later  use. 

9.  A  simple  calculation  on  water. 
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10.  The  previous,  optimized,  geometry  is  to  be  used  to  start  this 
calculation. 

11.  The  density  matrix  written  out  earlier  is  now  used  as  input  to 
start  an  SCF. 


This  example  is  taken  from  the  first  data-file  in  TESTDATA.DAT,  and 
illustrates  the  working  of  a  FORCE  calculation. 


5.1  DATA  FILE  FOR  A  FORCE  CALCULATION 

Line  1  nointer  noxyz  +  mndo  dump=8 

Line  2  t=2000  +  thermo( 298, 298)  force  isotope 

Line  3  ROT=2 

Line  4  DEMONSTRATION  OF  MOPAC  -  FORCE  AND  THERMODYNAMICS  CALCULATION 

Line  5  FORMALDEHYDE,  MNDO  ENERGY  =  -32.8819  See  Manual. 

Line  6  O 

Line  7  C  1.216487  1  100 

Line  8  H  1.106109  1  123.513310  1  210 

Line  9  H  1.106109  1  123.513310  1  180.000000  1  213 

Line  10  0  0.000000  0  0.000000  0  0.000000  0  000 


5.2  RESULTS  FILE  F(. k  THE  FORCE  CALCULATION 

**************************************************************************** 

**  FRANK  J.  SEILER  RES.  LAB.,  U.S.  AIR  FORCE  ACADEMY,  COLO.  SPGS.,  CO.  80840 
**************************************************************************** 

MNDO  CALCULATION  RESULTS 


MOPAC:  VERSION  6.00  CALC’D.  12-OCT-90 

T=  -  A  TIME  OF  2000.0  SECONDS  REQUESTED 

DUMP=N  -  RESTART  FILE  WRITTEN  EVERY  8.0  SECONDS 
FORCE  -  FORCE  CALCULATION  SPECIFIED 
PRECISE  -  CRITERIA  TO  BE  INCREASED  BY  100  TIMES 

NOINTER  -  INTERATOMIC  DISTANCES  NOT  TO  BE  PRINTED  Note  1 

ISOTOPE  -  FORCE  MATRIX  WRITTEN  TO  DISK  (CHAN.  9  ) 

NOXYZ  -  CARTESIAN  COORDINATES  NOT  TO  BE  PRINTED 
THERMO  -  THERMODYNAMIC  QUANTITIES  TO  BE  CALCULATED 
ROT  -  SYMMETRY  NUMBER  OF  2  SPECIFIED 

******************************************************************040By040 
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NOINTER  NOXYZ  +  MNDO  DUMP=8 
T=2000  +  THERMO( 298/298)  FORCE  ISOTOPE 
ROT=2  PRECISE 

DEMONSTRATION  OF  MOPAC  -  FORCE  AND  THERMODYNAMICS  CALCULATION 
FORMALDEHYDE,  MNDO  ENERGY  =  -32.8819  See  Manual. 

ATOM  CHEMICAL  BOND  LENGTH  BOND  ANGLE  TWIST  ANGLE 

NUMBER  SYMBOL  (ANGSTROMS)  (DEGREES)  (DEGREES) 

(I)  NA:  I  NB : NA : I  NC:NB;NA:I  NA  NB  NC 

ATOM  CHEMICAL  BOND  LENGTH  BOND  ANGLE  TWIST  ANGLE 

1  O 

2  C  1.21649  *  1 

3  H  1.10611  *  123.51331  *  21 

4  H  1.10611  *  123.51331  *  180.00000  *  2  1  3 

H:  (MNDO):  M.J.S.  DEWAR,  W.  THIEL,  J.  AM.  CHEM.  SOC.,  99,  4899,  (1977) 

C:  (MNDO):  M.J.S.  DEWAR,  W.  THIEL,  J.  AM.  CHEM.  SOC.,  99,  4899,  (1977) 

O:  (MNDO):  M.J.S.  DEWAR,  W.  THIEL,  J.  AM.  CHEM.  SOC.,  99,  4899,  (1977) 


RHF  CALCULATION,  NO.  OF  DOUBLY  OCCUPIED  LEVELS  =  6 


HEAT  OF  FORMATION  =  -32.881900  K CALS/MOLE 


INTERNAL  COORDINATE  DERIVATIVES 
ATOM  AT.  NO.  BOND  ANGLE  DIHEDRAL 

1  O 

2  C  0.000604 

3  H  0.000110  -0.000054 

4  H  0.000110  -0.000054  0.000000 


GRADIENT  NORM  =  0.00063  Note  2 


TIME  FOR  SCF  CALCULATION  =  0.45 


TIME  FOR  DERIVATIVES  =  0.32  Note  3 

MOLECULAR  WEIGHT  =  30.03 
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PRINCIPAL  MOMENTS  OF  INERTIA  IN  CM(-l) 

A  -  9.832732  B  =  1.261998  C  =  1.118449 


PRINCIPAL  MOMENTS  OF  INERTIA  IN  UNITS  OF  10**(-40)*GRAM-CM**2 
A  =  2.846883  B  =  22.181200  C  =  25.028083 


ORIENTATION  OF  MOLECULE  IN  FORCE  CALCULATION 
NO.  ATOM  X  Y  Z 

1  8  -0.6093  0.0000  0.0000 

2  6  0.6072  0.0000  0.0000 

3  1  1.2179  0.9222  0.0000 

4  1  1.2179  -0.9222  0.0000 


FIRST  DERIVATIVES  WILL  BE  USED  IN  THE  CALCULATION  OF  SECOND  DERIVATIVES 


ESTIMATED  TIME  TO  COMPLETE  CALCULATION  =  35.96  SECONDS 


STEP: 

1 

TIME  = 

2.15 

SECS, 

INTEGRAL 

2.15 

TIME 

LEFT: 

1997.08 

STEP: 

2 

TIME  = 

2.49 

SECS, 

INTEGRAL 

= 

4.64 

TIME 

LEFT: 

1994.59 

STEP: 

3 

TIME  = 

2.53 

SECS, 

INTEGRAL 

= 

7.17 

TIME 

LEFT: 

1992.06 

STEP: 

4 

TIME  = 

2.31 

SECS, 

INTEGRAL 

3 

9.48 

TIME 

LEFT: 

1989.75 

STEP: 

5 

RESTART 

FILE  WRITTEN, 

INTEGRAL 

= 

11.97 

TIME 

LEFT: 

1987.26 

STEP: 

6 

TIME  = 

2.43 

SECS, 

INTEGRAL 

= 

14.40 

TIME 

LEFT: 

1984.83 

STEP: 

7 

TIME  = 

2.32 

SECS, 

INTEGRAL 

= 

16.72 

TIME 

LEFT: 

1982.51 

STEP: 

8 

TIME  = 

2.30 

SECS, 

INTEGRAL 

19.02 

TIME 

LEFT: 

1980.21 

STEP: 

9 

RESTART 

FILE  WRITTEN, 

INTEGRAL 

= 

22.17 

TIME 

LEFT: 

1977.06 

STEP: 

10 

TIME  = 

2.52 

SECS, 

INTEGRAL 

= 

24.69 

TIME 

LEFT: 

1974.54 

STEP: 

11 

TIME  = 

2.25 

SECS, 

INTEGRAL 

= 

26.94 

TIME 

LEFT: 

1972.29 

STEP: 

12 

TIME  = 

3.15 

SECS, 

INTEGRAL 

= 

30.09 

TIME 

LEFT: 

1969.14 

0 


FORCE  MATRIX  IN  MILLIDYNES/ANGSTROM 

01  C  2  H  3  H  4 


O  1  9.557495 

C  2  8.682982  11.426823 

H  3  0.598857  2.553336  3.034881 

H  4  0.598862  2.553344  0.304463  3.034886 


HEAT  OF  FORMATION  =  -32.881900  KCALS/MOLE 


ZERO  POINT  ENERGY  18.002  KILOCALORIES  PER  MOLE  Note  4 
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THE  LAST  6  VIBRATIONS  ARE  THE  TRANSLATION  AND  ROTATION  MODES 

THE  FIRST  THREE  OF  THESE  BEING  TRANSLATIONS  IN  X,  Y,  AND  Z,  RESPECTIVELY 


NORMAL  COORDINATE  ANALYSIS 


Note  5 


ROOT  NO. 

1 

2 

3 

4 

5 

6 

1209.90331 

1214.67040 

1490.52685 

2114.53841 

3255.93651 

3302.12319 

1 

0.00000 

0.00000 

-0.04158 

-0.25182 

0.00000 

0.00067 

2 

0.06810 

0.00001 

0.00000 

0.00000 

0.00409 

0.00000 

3 

0.00000 

-0.03807 

0.00000 

0.00000 

0.00000 

0.00000 

4 

0.00000 

0.00000 

-0.03819 

0.32052 

0.00000 

-0.06298 

5 

-0.13631 

-0.00002 

0.00000 

0.00000 

0.08457 

0.00000 

6 

-0.00002 

0.15172 

0.00000 

0.00000 

0.00000 

0.00000 

7 

-0.53308 

-0.00005 

0.55756 

0.08893 

-0.39806 

0.36994 

8 

0.27166 

0.00003 

-0.38524 

0.15510 

-0.53641 

0.57206 

9 

0.00007 

-0.60187 

0.00001 

0.00000 

0.00000 

0.00000 

10 

0.53307 

0.00006 

0.55757 

0.08893 

0.39803 

0.36997 

11 

0.27165 

0.00003 

0.38524 

-0.15509 

-0.53637 

-0.57209 

12 

0.00007 

-0.60187 

0.00001 

0.00000 

0.00000 

0.00000 

ROOT  NO. 

7 

8 

9 

10 

11 

12 

-0.00019 

-0.00044 

-0.00016 

3.38368 

2.03661 

-0.76725 

1 

0.25401 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

2 

0.00000 

-0.25401 

0.00000 

0.00000 

0.00000 

-0.17792 

3 

0.00000 

0.00000 

-0.25401 

0.00000 

-0.19832 

0.00000 

4 

0.25401 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

5 

0.00000 

-0.25401 

0.00000 

0.00000 

0.00000 

0.17731 

6 

0.00000 

0.00000 

-0.25401 

0.00000 

0.19764 

0.00000 

7 

0.25401 

0.00000 

0.00000 

0.00000 

0.00000 

-0.26930 

8 

0.00000 

-0.25401 

0.00000 

0.00000 

0.00000 

0.35565 

9 

0.00000 

0.00000 

-0.25401 

0.70572 

0.39642 

0.00000 

10 

0.25401 

0.00000 

0.00000 

0.00000 

0.00000 

0.26930 

11 

0.00000 

-0.25401 

0.00000 

0.00000 

0.00000 

0.35565 

12 

0.00000 

0.00000 

-0.25401 

-0.70572 

0.39642 

0.00000 
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MASS-WEIGHTED  COORDINATE  ANALYSIS 


Note  6 


ROOT  NO. 

1 

2 

3 

4 

5 

6 

1209.90331 

1214.67040 

1490.52685 

2114.53841 

3255.93651 

3302.12319 

1 

0.00000 

0.00000 

-0.16877 

-0.66231 

0.00000 

0.00271 

2 

0.26985 

0.00003 

0.00000 

0.00000 

0.01649 

0.00000 

3 

0.00002 

-0.15005 

0.00000 

0.00000 

0.00000 

0.00000 

4 

0.00000 

0.00000 

-0.13432 

0.73040 

0.00001 

-0.22013 

5 

-0.46798 

-0.00005 

0.00000 

0.00000 

0.29524 

0.00001 

6 

-0.00006 

0.51814 

0.00000 

0.00000 

0.00000 

0.00000 

7 

-0.53018 

-0.00005 

0.56805 

0.05871 

-0.40255 

0.37455 

8 

0.27018 

0.00003 

-0.39249 

0.10238 

-0.54246 

0.57918 

9 

0.00007 

-0.59541 

0.00001 

0.00000 

0.00000 

0.00000 

10 

0.53018 

0.00006 

0.56806 

0.05871 

0.40252 

0.37457 

11 

0.27018 

0.00003 

0.39249 

-0.10238 

-0.54242 

-0.57922 

12 

0.00007 

-0.59541 

0.00001 

0.00000 

0.00000 

0.00000 

ROOT  NO. 

7 

8 

9 

10 

11 

Note  7 

12 

-0.00025 

-0.00022 

-0.00047 

3.38368 

2.03661 

-0.76725 

1 

0.72996 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

2 

0.00000 

-0.72996 

0.00000 

0.00000 

0.00000 

-0.62774 

3 

0.00000 

0.00000 

-0.72996 

0.00000 

-0.66681 

0.00000 

4 

0.63247 

0.00000 

0.00000 

0.00000 

0.00000 

0.00000 

5 

0.00000 

-0.63247 

0.00000 

0.00000 

0.00000 

0.54204 

6 

0.00000 

0.00000 

-0.63247 

0.00000 

0.57578 

0.00000 

7 

0.18321 

0.00000 

0.00000 

0.00000 

0.00000 

-0.23848 

8 

0.00000 

-0.18321 

0.00000 

0.00000 

0.00000 

0.31495 

9 

0.00000 

0.00000 

-0.18321 

0.70711 

0.33455 

0.00000 

10 

0.18321 

0.00000 

0.00000 

0.00000 

0.00000 

0.23848 

11 

0.00000 

-0.18321 

0.00000 

0.00000 

0.00000 

0.31495 

12 

0.00000 

0.00000 

-0.18321 

-0.70711 

0.33455 

0.00000 
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DESCRIPTION  OF  VIBRATIONS 


VIBRATION 

1 

ATOM  PAIR 

ENERGY 

CONTRIBUTION 

RADIAL 

FREQ. 

1209.90 

C  2  —  H  3 

42.7%  (  79.4%) 

12.6% 

T-DIPOLE 

0.8545 

C  2  —  H  4 

42.7% 

12.6% 

TRAVEL 

0.1199 

O  1  —  C  2 

14.6% 

0.0% 

RED.  MASS 

1.9377 

VIBRATION 

2 

ATOM  PAIR 

ENERGY 

CONTRIBUTION 

RADIAL 

FREQ. 

1214.67 

C  2  —  H  3 

45.1%  (  62.3%) 

0.0% 

T-DIPOLE 

0.1275 

C  2  —  H  4 

45.1% 

0.0% 

TRAVEL 

0.1360 

O  1  —  C  2 

9.8% 

0.0% 

RED.  MASS 

1.5004 

VIBRATION 

3 

ATOM  PAIR 

ENERGY 

CONTRIBUTION 

RADIAL 

FREQ. 

1490.53 

C  2  —  H  4 

49.6%  (  61.5%) 

0.6% 

T-DIPOLE 

0.3445 

C  2  —  H  3 

49.6% 

0.6% 

TRAVEL 

0.1846 

O  1  —  C  2 

0.9% 

100.0% 

RED.  MASS 

0.6639 

VIBRATION 

4 

ATOM  PAIR 

ENERGY 

CONTRIBUTION 

RADIAL 

FREQ. 

2114.54 

O  1  —  C  2 

60.1%  (100.5%) 

100.0% 

T-DIPOLE 

3.3662 

C  2  —  H  4 

20.0% 

17.7% 

TRAVEL 

0.0484 

C  2  —  H  3 

20.0% 

17.7% 

RED.  MASS 

6.7922 

VIBRATION 

5 

ATOM  PAIR 

ENERGY 

CONTRIBUTION 

RADIAL 

FREQ. 

3255.94 

C  2  —  H  3 

49.5%  (  72.2%) 

98.1% 

T-DIPOLE 

0.7829 

C  2  —  H  4 

49.5% 

98.1% 

TRAVEL 

0.1174 

O  1  —  C  2 

1.0% 

0.0% 

RED.  MASS 

0.7508 

VIBRATION 

6 

ATOM  PAIR 

ENERGY 

CONTRIBUTION 

RADIAL 

FREQ. 

3302.12 

C  2  —  H  4 

49.3%  (  69.8%) 

95.5% 

T-DIPOLE 

0.3478 

C  2  —  H  3 

49.3% 

95.5% 

TRAVEL 

0.1240 

O  1  —  C  2 

1.4% 

100.0% 

RED.  MASS 

0.6644 

SYSTEM  IS  A  GROUND  STATE 


FORMALDEHYDE,  MNDO  ENERGY  =  -32.8819  See  Manual. 
DEMONSTRATION  OF  MOPAC  -  FORCE  AND  THERMODYNAMICS  CALCULATION 


MOLECULE  IS  NOT  LINEAR 

THERE  ARE  6  GENUINE  VIBRATIONS  IN  THIS  SYSTEM 
THIS  THERMODYNAMICS  CALCULATION  IS  LIMITED  TO 
MOLECULES  WHICH  HAVE  NO  INTERNAL  ROTATIONS 
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TEMP.  (K) 


CALCULATED  THERMODYNAMIC  PROPERTIES 

* 


Note  8 


PARTITION  FUNCTION  H.O.F. 

KCAL/MOL 


ENTHALPY  HEAT  CAPACITY  ENTROPY 
CAL/MOLE  CAL/K/MOL  CAL/K/MOL 


298  VIB. 
ROT. 
INT. 
TRA. 
TOT. 


1.007 

709. 

714. 

0 . 159E+27 


23.39484 

888.305 

911.700 

1480.509 

-32.882  2392.2088 


0.47839 

2.981 

3.459 

4.968 

8.4274 


0.09151 

16.026 

16.117 

36.113 

52.2300 


*  NOTE:  HEATS  OF  FORMATION  ARE  RELATIVE  TO  THE 

ELEMENTS  IN  THEIR  STANDARD  STATE  AT  298K 


TOTAL  CPU  TIME:  32.26  SECONDS 
==  MOPAC  DONE  == 

NOTE  1:  All  three  words,  ROT,  FORCE,  and  THERMO  are  necessary  in 
order  to  obtain  thermodynamic  properties.  In  order  to  obtain  results 
for  only  one  temperature,  THERMO  has  the  first  and  second  arguments 
identical.  The  symmetry  number  for  the  C2v  point-group  is  2. 

NOTE  2:  Internal  coordinate  derivatives  are  in  Kcal/Angstrom  or 
Kcal/radian.  Values  of  less  than  about  0.2  are  quite  acceptable. 

NOTE  3:  In  larger  calculations,  the  time  estimates  are  useful.  In 
practice  they  are  pessimistic,  and  only  about  70%  of  the  time  estimated 
will  be  used,  usually.  The  principal  moments  of  inertia  can  be  directly 
related  to  the  microwave  spectrum  of  the  molecule.  They  are  simple 
functions  of  the  geometry  of  the  system,  and  are  usually  predicted  with 
very  high  accuracy. 

NOTE  4:  Zero  point  energy  is  already  factored  into  the  MNDO 
parameterization.  Force  constant  data  are  not  printed  by  default.  If 
you  want  this  output,  specify  LARGE  in  the  keywords. 

NOTE  5:  Normal  coordinate  analysis  has  been  extensively  changed. 
The  first  set  of  eigenvectors  represent  the  'normalized*  motions  of  the 
atoms.  The  sum  of  the  speeds  (not  the  velocities)  of  the  atoms  adds  to 
unity.  This  is  verified  by  looking  at  the  motion  in  the  'z'  direction 
of  the  atoms  in  vibration  2.  Simple  addition  of  these  terms,  unsigned, 
adds  to  1.0,  whereas  to  get  the  same  result  for  mode  1  the  scalar  of  the 
motion  of  each  atom  needs  to  be  calculated  first. 

Users  might  be  concerned  about  reproducibility.  As  can  be  seen 
from  the  vibrational  frequencies  from  Version  3.00  to  6.00  given  below, 
the  main  difference  over  earlier  FORCE  calculations  is  in  the  trivial 
frequencies. 
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Real  Frequencies  of  Formaldehyde 


Version 

3.00 

1209.96 

1214.96 

1490.60 

2114.57 

3255.36 

3301.57 

Version 

3.10 

1209.99 

1215.04 

1490.59 

2114.57 

3255.36 

3301.58 

Version 

4.00 

1209.88 

1214.67 

1490.52 

2114.52 

3255.92 

3302.10 

Version 

5.00 

1209.89 

1214.69 

1490.53 

2114.53 

3255.93 

3302.10 

Version 

6.00 

1209.90 

1214.67 

1490.53 

2114.54 

3255.94 

3302.12 

Trivial  Frequencies  of 

:  Formaldehyde 

T(x) 

T(y) 

T(z) 

R(x) 

R(y) 

R(z) 

Version 

3.00 

-0.00517 

-0.00054 

-0.00285 

57.31498 

11.59518 

9.01619 

Version 

3.10 

-0.00557 

0.00049 

-0.00194 

87.02506 

11.18157 

10.65295 

Version 

4.00 

-0.00044 

-0.00052 

-0.00041 

12.99014 

-3.08110 

-3.15427 

Version 

5.00 

0.00040 

-0.00044 

-0.00062 

21.05654 

2.80744 

3.83712 

Version 

6.00 

-0.00025 

-0.00022 

-0.00047 

3.38368 

2.03661 

-0.76725 

NOTE  6:  Normal  modes  are  not  of  much  use  in  assigning  relative 
importance  to  atoms  in  a  mode.  Thus  in  iodomethane  it  is  not  obvious 
from  an  examination  of  the  normal  modes  which  mode  represents  the  C-I 
stretch.  A  more  useful  description  is  provided  by  the  energy  or 
mass-weighted  coordinate  analysis.  Each  set  of  three  coefficients  now 
represents  the  relative  energy  carried  by  an  atom.  (This  is  not 
strictly  accurate  as  a  definition,  but  is  believed  (by  JJPS)  to  be  more 
useful  than  the  stricter  definition.) 

NOTE  7s  The  following  description  of  the  coordinate  analysis  is 
given  without  rigorous  justification.  Again,  the  analysis,  although 
difficult  to  understand,  has  been  found  to  be  more  useful  than  previous 
descriptions. 

On  the  left-hand  side  are  printed  the  frequencies  and  transition 
dipoles.  Underneath  these  are  the  reduced  masses  and  idealized 
distances  traveled  which  represent  the  simple  harmonic  motion  of  the 
vibration.  The  mass  is  assumed  to  be  attached  by  a  spring  to  an 
infinite  mass.  Its  displacement  is  the  travel. 

The  next  column  is  a  list  of  all  pairs  of  atoms  that  contribute 
significantly  to  the  energy  of  the  mode.  Across  from  each  pair  (next 
column)  is  the  percentage  energy  contribution  of  the  pair  to  the  mode, 
calculated  according  to  the  formula  described  below. 

FORMULA  FOR  ENERGY  CONTRIBUTION 

The  total  vibrational  energy,  T,  carried  by  all  pairs  of  bonded 
atoms  in  a  molecule  is  first  calculated.  For  any  given  pair  of  atoms,  A 
and  B,  the  relative  contribution,  R.C.(A,B),  as  a  percentage,  is  given 
by  the  energy  of  the  pair,  P(A,B),  times  100  divided  by  T,  i.e., 

R.C. (A,B)  =  100P( A,B)/T 


As  an  example,  for  formaldehyde  the  energy  carried  by  the  pair  of 
atoms  (C,0)  is  added  to  the  energy  of  the  two  (C,H)  pairs  to  give  a 
total,  T.  Note  that  this  total  cannot  be  related  to  anything  which  is 
physically  meaningful  (there  is  obvious  double-counting),  but  it  is  a 
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convenient  artifice.  For  mode  4,  the  C=0  stretch,  the  relative 
contribution  of  the  carbon-oxygen  pair  is  60.1%.  It  might  be  expected 
to  be  about  100%  (after  all,  we  envision  the  C-0  bond  as  absorbing  the 
photon);  however,  the  fact  that  the  carbon  atom  is  vibrating  implies 
that  it  is  changing  its  position  relative  to  the  two  hydrogen  atoms.  If 
the  total  vibrational  energy,  Ev  (the  actual  energy  of  the  absorbed 
photon,  as  distinct  from  T),  were  carried  equally  by  the  carbon  and 
oxygen  atoms,  then  the  relative  contributions  to  the  mode  would  be  C=0, 
50%  ;  C-H,  25%  ;  C-H,  25%,  respectively.  This  leads  to  the  next  entry, 
which  is  given  in  parentheses. 

For  the  pair  with  the  highest  relative  contribution  (in  mode  4,  the 
C=0  stretch),  the  energy  of  that  pair  divided  by  the  total  energy  of  the 
mode,  Ev,  is  calculated  as  a  percentage.  This  is  the  absolute 
contribution,  A.C.  as  a  percentage,  to  the  total  energy  of  the  mode. 


A.C. (A,B)  =  100P(A,B)/Ev 


Now  the  C=0  is  seen  to  contribute  100.5  percent  of  the  energy.  For  this 
sort  of  partitioning  only  the  sum  of  all  A.C.'s  must  add  to  100%,  each 
pair  can  contribute  more  or  less  than  100%.  In  the  case  of  a  free 
rotator,  e.g.  ethane,  the  A.C.  of  any  specific  bonded  pair  to  the 
total  energy  can  be  very  high  (several  hundred  percent). 

It  may  be  easier  to  view  P/Ev  as  a  contribution  to  the  total  energy 
of  the  mode,  Ev.  In  this  case  the  fact  that  P/Ev  can  be  greater  than 
unity  can  be  explained  by  the  fact  that  there  are  other  relative  motions 
within  the  molecule  which  make  a  negative  contribution  to  Ev. 

From  the  R.C.'s  an  idea  can  be  obtained  of  where  the  energy  of  the 
mode  is  going;  from  the  A.C.  value  the  significance  of  the  highest 
contribution  can  be  inferred.  Thus,  in  mode  4  all  three  bonds  are 
excited,  but  because  the  C-0  bond  carries  about  100%  of  the  energy,  it 
is  clear  that  this  is  really  a  C=0  bond  stretch  mode,  and  that  the 
hydrogens  are  only  going  along  for  the  ride. 

In  the  last  column  the  percentage  radial  motion  is  printed.  This 
is  useful  in  assigning  the  mode  as  stretching  or  bending.  Any 
non-radial  motion  is  de-facto  tangential  or  bending. 

To  summarize:  The  new  analysis  is  more  difficult  to  understand, 
but  is  considered  by  the  author  (JJPS)  to  be  the  easiest  way  of 
describing  what  are  often  complicated  vibrations. 

NOTE  8:  In  order,  the  thermodynamic  quantities  calculated  are: 

(1)  The  vibrational  contribution, 

(2)  The  rotational  contribution, 

(3)  The  sum  of  (1)  and  (2),  this  gives  the  internal  contribution. 
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(4)  The  translational  contribution. 

For  partition  functions  the  various  contributions  are  multiplied 
together . 

A  new  quantity  is  the  heat  of  formation  at  the  defined  temperature. 
This  is  intended  for  use  in  calculating  heats  of  reaction.  Because  of  a 
limitation  in  the  data  available,  the  H.o.F.  at  T  Kelvin  is  defined  as 
"The  heat  of  formation  of  the  compound  at  T  Kelvin  from  it's  elements  in 
their  standard  state  at  298  Kelvin".  Obviously,  this  definition  of  heat 
of  formation  is  incorrect,  but  should  be  useful  in  calculating  heats  of 
reaction,  where  the  elements  in  their  standard  state  at  298  Kelvin  drop 
out . 


5.3  EXAMPLE  OF  REACTION  PATH  WITH  SYMMETRY 

In  this  example,  one  methyl  group  in  ethane  is  rotated  relative  to 
the  other  and  the  geometry  is  optimized  at  each  point.  As  the  reaction 
coordinate  involves  three  hydrogen  atoms  moving,  symmetry  is  imposed  to 


ensure 

equivalence 

of  all  hydrogens. 

Line 

1 

SYMMETRY 

T=600 

Line 

2 

ROTATION  OF  METHYL  GROUP  IN 

ETHANE 

Line 

3 

EXAMPLE  OF  A  REACTION  PATH  CALCULATION 

Line 

4 

C 

Line 

5 

C 

1.479146 

1 

Line 

6 

H 

1.109475 

1 

111.328433 

1 

Line 

7 

H 

1.109470 

0 

111.753160 

0 

120.000000 

0 

2 

1 

3 

Line 

8 

H 

1.109843 

0 

110.103163 

0 

240.000000 

0 

2 

1 

3 

Line 

9 

H 

1.082055 

0 

121.214083 

0 

60.000000 

-1 

1 

2 

3 

Line 

10 

H 

1.081797 

0 

121.521232 

0 

180.000000 

0 

1 

2 

3 

Line 

11 

H 

1.081797 

0 

121.521232 

0 

-60.000000 

0 

1 

2 

3 

Line 

12 

0 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 
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Points  to  note: 

(1)  The  dihedrals  of  the  second  and  third  hydrogens  are  not  marked 
for  optimization:  the  dihedrals  follow  from  point-group  symmetry. 

(2)  All  six  C-H  bond  lengths  and  H-C-C  angles  are  related  by 
symmetry:  see  lines  13  and  14. 

(3)  The  dihedral  on  line  9  is  the  reaction  coordinate,  while  the 
dihedrals  on  lines  10  and  11  are  related  to  it  by  symmetry  functions  on 
lines  15  and  16.  The  symmetry  functions  are  defined  by  the  second 
number  on  lines  13  to  16  (see  SYMMETRY  for  definitions  of  functions  1, 
2,  7,  and  11). 
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(4)  Symmetry  data  are  ended  by  a  blank  line. 

(5)  The  reaction  coordinate  data  are  ended  by  the  end  of  file. 
Several  lines  of  data  are  allowed. 

(6)  Whenever  symmetry  is  used  in  addition  to  other  data  below  the 
geometry  definition  it  will  always  follow  the  "blank  line"  immediately 
following  the  geometry  definition.  The  other  data  will  always  follow 
the  symmetry  data. 


CHAPTER  6 


BACKGROUND 


6 . 1  INTRODUCTION 

While  all  the  theory  used  in  MOPAC  is  in  the  literature,  so  that  in 
principle  one  could  read  and  understand  the  algorithm,  many  parts  of  the 
code  involve  programming  concepts  or  constructions  which,  while  not  of 
sufficient  importance  to  warrant  publication,  are  described  here  in 
order  to  facilitate  understanding. 


6 . 2  AIDER 

AIDER  will  allow  gradients  to  be  defined  for  a  system.  MOPAC  will 
calculate  gradients,  as  usual,  and  will  then  use  the  supplied  gradients 
to  form  an  error  function.  This  error  function  is:  (supplied  gradients 
-  initial  calculated  gradients),  which  is  then  added  to  the  computed 
gradients,  so  that  for  the  initial  SCF,  the  apparent  gradients  will 
equal  the  supplied  gradients. 

A  typical  data-set  using  AIDER  would  look  like  this: 

PM3  AIDER  AIGOUT  GNORM=0.01  EF 
Cyclohexane 

X 


X 

1 

1.0 

c 

1 

cx 

2 

CXX 

c 

1 

CX 

2 

CXX 

3 

120.000000 

c 

1 

CX 

2 

CXX 

3 

-120.000000 

X 

1 

1.0 

2 

90.0 

3 

0.000000 

X 

1 

1.0 

6 

90.0 

2 

180.000000 

c 

1 

CX 

7 

CXX 

3 

180.000000 

c 

1 

CX 

7 

CXX 

3 

60.000000 

c 

1 

CX 

7 

CXX 

3 

-60.000000 

H 

3 

H1C 

1 

H1CX 

2 

0.000000 

H 

4 

H1C 

1 

H1CX 

2 

0.000000 

H 

5 

H1C 

1 

H1CX 

2 

0.000000 

H 

8 

H1C 

1 

H1CX 

2 

180.000000 

H 

9 

H1C 

1 

H1CX 

2 

180.000000 

H 

10 

H1C 

1 

H1CX 

2 

180.000000 

H 

3 

H2C 

1 

H2CX 

2 

180.000000 
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H  4 

H2C  1 

H2CX  2 

180.000000 

H  5 

H2C  1 

H2CX  2 

180.000000 

H  8 

H2C  1 

H2CX  2 

0.000000 

H  9 

H2C  1 

H2CX  2 

0.000000 

H  10 

H2C  1 

H2CX  2 

0.000000 

CX  1 

.46613 

H1C  1 

.10826 

H2C  1 

.10684 

CXX  80 

.83255 

H1CX  103 

.17316 

H2CX  150 

.96100 

AIDER 

0.0000 

13.7589 

-1.7383 

13.7589 

-1.7383 

0.0000 

13.7589 

-1.7383 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

13.7589 

-1.7383 

0.0000 

13.7589 

-1.7383 

0.0000 

13.7589 

-1.7383 

0.0000 

-17.8599 

-2.1083 

0.0000 

-17.8599 

-2.1083 

0.0000 

-17.8599 

-2.1083 

0.0000 

-17.8599 

-2.1083 

Q.0000 

-17.8599 

-2.1083 

0.0000 

-17.8599 

-2.1083 

0.0000 

-17.5612 

-0.6001 

0.0000 

-17.5612 

-0.6001 

0.0000 

-17.5612 

-0.6001 

0.0000 

-17.5612 

-0.6001 

0.0000 

-17.5612 

-0.6001 

0.0000 

-17.5612 

-0.6001 

0.0000 

Each  supplied  gradient  goes  with  the  corresponding  internal 
coordinate.  In  the  example  given,  the  gradients  came  from  a  3-21G 
calculation  on  the  geometry  shown.  Symmetry  will  be  taken  into  account 
automatically.  Gaussian  prints  out  gradients  in  atomic  units;  these 
need  to  be  converted  into  kcal/mol/Angstrom  or  kcal/mol/radian  for  MOPAC 
to  use.  The  resulting  geometry  from  the  MOPAC  run  will  be  nearer  to  the 
optimized  3-21G  geometry  than  if  the  normal  geometry  optimizers  in 
Gaussian  had  been  used. 


6.3  CORRECTION  TO  THE  PEPTIDE  LINKAGE 

The  residues  in  peptides  are  joined  together  by  peptide  linkages, 
-HNCO-.  These  linkages  are  almost  flat,  and  normally  adopt  a  trans 
configuration;  the  hydrogen  and  oxygen  atoms  being  on  opposite  sides  of 
the  C-N  bond.  Experimentally,  the  barrier  to  interconversion  in 
N-methyl  acetamide  is  about  14  Kcal/mole,  but  all  four  methods  within 
MOPAC  predict  a  significantly  lower  barrier,  PM3  giving  the  lowest 
value . 
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The  low  barrier  can  be  traced  to  the  tendency  of  semiempirical 
methods  to  give  pyramidal  nitrogens.  The  degree  to  which 
pyramidalization  of  the  nitrogen  atom  is  preferred  can  be  seen  in  the 
following  series  of  compounds. 


Compound 

MINDO/3 

MNDO 

AMI 

PM3 

Exp 

Ammonia 

Py 

Py 

Py 

Py 

Py 

Aniline 

Py 

Py 

Py 

Py 

Py 

Formamide 

Py 

Py 

Flat 

Py 

Py 

Acetamide 

Flat 

Py 

Flat 

Py 

Flat 

N-methyl  formamide 

Flat 

Py 

Flat 

Py 

Flat 

N-methyl  acetamide 

Flat 

Flat 

Flat 

Py 

Flat 

To  correct  this,  a  molecular-mechanics  correction  has  been  applied. 
This  consists  of  identifying  the  -R-HNCO-  unit,  and  adding  a  torsion 
potential  of  form 


2 

Cons t*Sin( Theta) 


where  Theta  is  the  X-N-C-0  angle,  X=R  or  H,  and  Const  varies  from  method 
to  method.  This  has  two  effects:  there  is  a  force  constraining  the 
nitrogen  to  be  planar,  and  HNCO  barrier  in  N-methyl  acetamide  is  raised 
to  14.00  Kcal/mole.  When  the  MM  correction  is  in  place,  the  nitrogen 
atom  for  all  methods  for  the  last  three  compounds  shown  above  is  planar. 
The  correction  should  be  user-transparent 

Cautions 

1.  This  correction  will  lead  to  errors  of  0.5  -  1.5  Kcal/mole  if 
the  peptide  linkage  is  made  or  broken  in  a  reaction 
calculation. 

2.  If  the  correction  is  applied  to  formamide  the  nitrogen  will  be 
flat,  contrary  to  experiment. 

3.  When  calculating  rotation  barriers,  take  into  account  the  rapid 
rehybridization  which  occurs.  When  the  dihedral  is  0  or  180 
degrees  the  nitrogen  will  be  planar  (sp2),  but  at  90  degrees 
the  nitrogen  should  be  pyramidal,  as  the  partial  double  bond  is 
broken.  At  that  geometry  the  true  transition  state  involves 
motion  of  the  nitrogen  substituent  so  that  the  nitrogen  in  the 
transition  state  is  more  nearly  sp2.  In  other  words,  a  simple 
rotation  of  the  HNCO  dihedral  will  not  yield  the  activation 
barrier,  however  it  will  be  within  2  Kcal/mole  of  the  correct 
answer.  The  14  Kcal  barrier  mentioned  earlier  refers  to  the 
true  transition  state. 

4.  Any  job  involving  a  CONH  group  will  require  either  the  keyword 
NOMM  or  MMOK.  If  you  do  not  want  the  correction  to  be  applied, 
use  the  keyword  "NOMM"  (NO  Molecular  Mechanics). 
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6.4  LEVEL  OF  PRECISION  WITHIN  MOPAC 

Several  users  have  criticised  the  tolerances  within  MOPAC.  The 
point  made  is  that  significantly  different  results  have  been  obtained 
when  different  starting  conditions  have  been  used,  even  when  the  same 
conformer  should  have  resulted.  Of  course,  different  results  must  be 
expected  —  there  will  always  be  small  differences  —  nonetheless  any 
differences  should  be  small,  e.g.  heats  of  formation  (H.o.F.) 
differences  should  be  less  than  about  0.1  kcal/mole.  MOPAC  has  been 
modified  to  allow  users  to  specify  a  much  higher  precision  than  the 
default  when  circumstances  warrant  it. 

Reasons  for  low  precision 

There  are  several  reasons  for  obtaining  low  quality  results.  The 
most  obvious  cause  of  such  errors  is  that  for  general  work  the  default 
criteria  will  result  in  a  difference  in  H.o.F.  of  less  than  0.1 
Kcal/mole.  This  is  only  true  for  fairly  rigid  systems,  e.g. 
formaldehyde  and  benzene.  For  systems  with  low  barriers  to  rotation  or 
flat  potential  surfaces,  e.g.  aniline  or  water  dimer,  quite  large 
H.o.F.  errors  can  result. 

Various  Precision  Levels 

In  normal  (non-publication  quality)  work  the  default  precision  of 
MOPAC  is  recommended.  This  will  allow  reasonably  precise  results  to  be 
obtained  in  a  reasonable  time.  Unless  this  precision  proves 
unsatisfactory,  use  this  default  for  all  routine  work. 

The  best  way  of  controlling  the  precision  of  the  geometry 
optimization  and  gradient  minimization  is  by  specifying  a  gradient  norm 
which  must  be  satisfied.  This  is  done  via  the  keyword  GNORM= .  Altering 
the  GNORM  automatically  disables  the  other  termination  tests  resulting 
in  the  gradient  norm  dominating  the  calculation.  This  works  both  ways: 
a  GNORM  of  20  will  give  a  very  crude  optimization  while  a  GNORM  of  0.01 
will  give  a  very  precise  optimization.  The  default  GNORM  is  1.0. 

When  the  highest  precision  is  needed,  such  as  in  exacting  geometry 
work,  or  when  you  want  results  which  cannot  be  improved,  then  use  the 
combination  keywords  GNORM=0.0  and  SCFCRT=1.D-NN;  NN  should  be  in  the 
range  2-15.  Increasing  the  SCF  criterion  (the  default  is  SCFCRT=l.D-4) 
helps  the  line  search  routines  by  increasing  the  precision  of  the  heat 
of  formation  calculation;  however,  it  can  lead  to  excessive  run  times, 
so  take  care.  Also,  there  is  an  increased  chance  of  not  achieving  an 
SCF  when  the  SCF  criterion  is  excessively  increased. 

Superficially,  requesting  a  GNORM  of  zero  might  seem  excessively 
stringent,  but  as  soon  as  the  run  starts,  it  will  be  cut  back  to  0.01. 
Even  that  might  seem  too  stringent.  The  geometry  optimization  will 
continue  to  lower  the  energy,  and  hopefully  the  GNORM,  but  frequently  it 
will  not  prove  possible  to  lower  the  GNORM  to  0.01.  If,  after  10 
cycles,  the  energy  does  not  drop  then  the  job  will  be  stopped.  At  this 
point  you  have  the  best  geometry  that  MOPAC,  in  its  current  form,  can 
give. 
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If  a  slightly  less  than  highest  precision  is  needed,  such  as  for 
normal  publication  quality  work,  set  the  GNORM  to  the  limit  wanted.  For 
example,  for  a  flexible  system,  a  GNORM  of  0.1  to  0.5  will  normally  be 
good  enough  for  all  but  the  most  demanding  work. 

If  higher  than  the  default,  but  still  not  very  high  precision  is 
wanted,  then  use  the  keyword  PRECISE.  This  will  tighten  up  various 
criteria  so  that  higher  than  routine  precision  will  be  given. 

If  high  precision  is  used,  so  that  the  printed  GNORM  is  0.000,  and 
the  resulting  geometry  resubmitted  for  one  SCF  and  gradients 
calculation,  then  normally  a  GNORM  higher  than  0.000  will  result.  This 
is  NOT  an  error  in  MOPAC:  the  geometry  printed  is  only  precise  to  six 
figures  after  the  decimal  point.  Geometries  need  to  be  specified  to 
more  than  six  decimals  in  order  to  drive  the  GNORM  to  less  than  0.000. 

If  you  want  to  test  MOPAC,  or  use  it  for  teaching  purposes,  the 
GNORM  lower  limit  of  0.01  can  be  overridden  by  specifying  LET,  in  which 
case  you  can  specify  any  limit  for  GNORM.  However,  if  it  is  too  low  the 
job  may  finish  due  to  an  irreducible  minimum  in  the  heat  of  formation 
being  encountered.  If  this  happens,  the  "STATIONARY  POINT"  message  will 
be  printed. 

Finally  there  is  a  full  analytical  derivative  function  within 
MOPAC.  These  use  STO-6G  Gaussian  wavefunctions  because  the  derivatives 
of  the  overlap  integral  are  easier  to  calculate  in  Gaussians  than  in 
STO's.  Consequently,  there  will  be  a  small  difference  in  the  calculated 
H.o.F.s  when  analytical  derivatives  are  used.  If  there  is  any  doubt 
about  the  accuracy  of  the  finite  derivatives,  try  using  the  analytical 
derivatives.  They  are  a  bit  slower  than  finite  derivatives  but  are  more 
precise  (a  rough  estimate  is  12  figures  for  finite  difference,  14  for 
analytical) . 

Some  calculations,  mainly  open  shell  RHF  or  closed  shell  RHF  with 
C.I.  have  untracked  errors  which  prevent  very  high  precision.  For  these 
systems  GNORM  should  be  in  the  range  1.0  to  0.1. 

How  Large  can  a  Gradient  Be  and 
Still  Be  Acceptable? 

A  common  source  of  confusion  is  the  limit  to  which  the  GNORM  should 
be  reduced  in  order  to  obtain  acceptable  results.  There  is  no  easy 
answer,  however  a  few  guidelines  can  be  given. 

First  of  all  reducing  the  GNORM  to  an  arbitarily  small  number  is 
not  sensible.  If  the  keywords  GNORM=0 . 000001 ,  LET,  and  EF  are  used,  a 
geometry  con  be  obtained  which  is  precise  to  about  0.000001  Angstroms. 
If  ANALYT  is  also  used,  the  results  obtained  will  be  slightly  different. 
Chemically,  this  change  is  meaningless,  and  no  significance  should  be 
attached  to  such  numbers.  In  addition,  any  minor  change  to  the 
algorithm,  such  as  porting  it  to  a  new  machine,  will  give  rise  to  small 
changes  in  the  optimized  geometry.  Even  the  small  changes  involved  in 
going  from  MOPAC  5.00  to  MOPAC  6.00  caused  small  changes  in  the 
optimized  geometry  of  test  molecules. 
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As  a  guide,  a  GNORM  of  0.1  is  sufficient  for  all  heat-of-formation 
work/  and  a  GNORM  of  0.01  for  most  geometry  work.  If  the  system  is 
large/  you  may  need  to  settle  for  a  GNORM  of  1.0  -  0.5. 

This  whole  topic  was  raised  by  Dr.  Donald  B.  Boyd  of  Lilly 
Research  Laboratories/  who  provided  unequivocal  evidence  for  a  failure 
of  MOPAC  and  convinced  me  of  the  importance  of  increasing  precision  in 
certain  circumstances. 


6.5  CONVERGENCE  TESTS  IN  SUBROUTINE  ITER 

Self-Consistency  Test 

The  SCF  iterations  are  stopped  when  two  tests  are  satisfied.  These 
are  (1)  when  the  difference  in  electronic  energy,  in  eV,  between  any  two 
consecutive  iterations  drops  below  the  adjustable  parameter,  SELCON,  and 
the  difference  between  any  three  consecutive  iterations  drops  below  ten 
times  SELCON,  and  (2)  the  difference  in  density  matrix  elements  on  two 
successive  iterations  falls  below  a  preset  limit,  which  is  a  multiple  of 
SELCON. 

SELCON  is  set  initially  to  0.0001  kcal/mole;  this  can  be  made  100 
times  smaller  by  specifying  PRECISE  or  FORCE.  It  can  be  over-ridden  by 
explicitly  defining  the  SCF  criterion  via  SCFCRT=1.D-12. 

SELCON  is  further  modified  by  the  value  of  the  gradient  norm,  if 
known.  If  GNORM  is  large,  then  a  more  lax  SCF  criterion  is  acceptable, 
and  SCFCRT  can  be  relaxed  up  to  50  times  it's  default  value.  As  the 
gradient  norm  drops,  the  SCF  criterion  returns  to  its  default  value. 

The  SCF  test  is  performed  using  the  energy  calculated  from  the  Fock 
matrix  which  arises  from  a  density  matrix,  and  not  from  the  density 
matrix  which  arises  from  a  Fock.  In  the  limit,  the  two  energies  would 
be  identical,  but  the  first  converges  faster  than  the  second,  without 
loss  of  precision. 


6.6  CONVERGENCE  IN  SCF  CALCULATION 

A  brief  description  of  the  convergence  techniques  used  in 
subroutine  ITER  follows. 

ITER,  the  SCF  calculation,  employs  six  methods  to  achieve  a 
self-consistent  field.  In  order  of  usage,  these  are: 

(a)  Intrinsic  convergence  by  virtue  of  the  way  the  calculation  is 
carried  out.  Thus  a  trial  Fock  gives  rise  to  a  trial  density  matrix, 
which  in  turn  is  used  to  generate  a  better  Fock  matrix. 

This  is  normally  convergent,  but  many  exceptions  are  known.  The 
main  situations  when  the  intrinsic  convergence  does  not  work  are: 
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(1)  A  bad  starting  density  matrix.  This  normally  occurs 
when  the  default  starting  density  matrix  is  used.  This  is  a  very 
crude  approximation,  and  is  only  used  to  get  the  calculation 
started.  A  large  charge  is  generated  on  an  atom  in  the  first 
iteration,  the  second  iteration  overcompensates,  and  an 
oscillation  is  generated. 

(2)  The  equations  are  only  very  slowly  convergent.  This  can 
be  due  to  a  long-lived  oscillation  or  to  a  slow  transfer  of 
charge. 

(b)  Oscillation  damping.  If,  on  any  two  consecutive  iterations,  a 
density  matrix  element  changes  by  more  than  0.05,  then  the  density 
matrix  element  is  set  equal  to  the  old  element  shifted  by  0.05  in  the 
direction  of  the  calculated  element.  Thus,  if  on  iterations  3  and  4  a 
certain  density  matrix  element  was  0.55  and  0.78,  respectively,  then  the 
element  would  be  set  to  0.60  (=0.55+0.05)  on  iteration  4.  The  density 
matrix  from  iteration  4  would  then  be  used  in  the  construction  of  the 
next  Fock  matrix.  The  arrays  which  hold  the  old  density  matrices  are 
not  filled  until  after  iteration  2.  For  this  reason  they  are  not  used 
in  the  damping  before  iteration  3. 

(c)  Three-point  interpolation  of  the  density  matrix.  Subroutine 
CNVG  monitors  the  number  of  iterations,  and  if  this  is  exactly  divisible 
by  three,  and  certain  other  conditions  relating  to  the  density  matrices 
are  satisfied,  a  three-point  interpolation  is  performed.  This  is  the 
default  converger,  and  is  very  effective  with  normally  convergent 
calculations.  It  fails  in  certain  systems,  usually  those  where 
significant  charge  build-up  is  present. 

(d)  Energy-level  shift  technique.  The  virtual  M.O.  energy  levels 
are  normally  shifted  to  more  positive  energy.  This  has  the  effect  of 
damping  oscillations,  and  intrinsically  divergent  equations  can  often  be 
changed  to  intrinsically  convergent  form.  With  slowly-convergent 
systems  the  virtual  M.O.  energy  levels  can  be  moved  to  a  more  negative 
value. 


The  precise  value  of  the  shift  used  depends  on  the  behavior  of  the 
iteration  energy.  If  it  is  dropping,  then  the  HOMO-LUMO  gap  is  reduced, 
if  the  iteration  energy  rises,  the  gap  is  increased  rapidly. 

(e)  Pulay's  method.  If  requested,  when  the  largest  change  in 
density  matrix  elements  on  two  consecutive  iterations  has  dropped  below 
0.1,  then  routine  CNVG  is  abandoned  in  favor  of  a  multi-Fock  matrix 
interpolation.  This  relies  on  the  fact  that  the  eigenvectors  of  the 
density  and  Fock  matrices  are  identical  at  self-consistency,  so  [P.F]=0 
at  SCF.  The  extent  to  which  this  condition  does  not  occur  is  a  measure 
of  the  deviance  from  self-consistency.  Pulay's  method  uses  this 
relationship  to  calculate  that  linear  combination  of  Fock  matrices  which 
minimize  [P.F]'.  This  new  Fock  matrix  is  then  used  in  the  SCF 
calculation. 

Under  certain  circumstances,  Pulay's  method  can  cause  very  slow 
convergence,  but  sometimes  it  is  the  only  way  to  achieve  a 
self-consistent  field.  At  other  times  the  procedure  gives  a  ten-fold 
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increase  in  speed,  so  care  must  be  exercised  in  its  use.  (invoked  by 
the  keyword  PULAY) 

(£)  The  Camp-King  converger.  If  all  else  fails,  the  Camp-King 
converger  is  just  about  guaranteed  to  work  every  time.  However,  it  is 
time-consuming,  and  therefore  should  only  be  invoked  as  a  last  resort. 

It  evaluates  that  linear  combination  of  old  and  current 
eigenvectors  which  minimize  the  total  energy.  One  of  its  strengths  is 
that  systems  which  otherwise  oscillate  due  to  charge  surges,  e.g. 
CHO-H,  the  C-H  distance  being  very  large,  will  converge  using  this  very 
sophisticated  converger. 


6.7  CAUSES  OP  FAILURE  TO  ACHIEVE  AN  SCF 

In  a  system  where  a  biradical  can  form,  such  as  ethane  decomposing 
into  two  CH3  units,  the  normal  RHF  procedure  can  fail  to  go 
self-consistent.  If  the  system  has  marked  biradicaloid  character,  then 
BIRADICAL  or  UHF  and  TRIPLET  can  often  prove  successful.  These  options 
rely  on  the  assumption  that  two  unpaired  electrons  can  represent  the 
open  shell  part  of  the  wave-function. 

Consider  H-Cl,  with  the  interatomic  distance  being  steadily 
increased.  At  first  the  covalent  bond  will  be  strong,  and  a 
self-consistent  field  is  readily  obtained.  Gradually  the  bond  will 
become  more  ionic,  and  eventually  the  charge  on  chlorine  will  become 
very  large.  The  hydrogen,  meanwhile,  will  become  very  electropositive, 
and  there  will  be  an  increased  energy  advantage  to  any  one  electron  to 
transfer  from  chlorine  to  hydrogen.  If  this  in  fact  occurred,  the 
hydrogen  would  suddenly  become  very  electron-rich  and  would,  on  the  next 
iteration,  lose  its  extra  electron  to  the  chlorine.  A  sustained 
oscillation  would  then  be  initiated.  To  prevent  this,  if  BIRADICAL  is 
specified,  exactly  one  electron  will  end  up  on  hydrogen.  A  similar 
result  can  be  obtained  by  specifying  TRIPLET  in  a  UHF  calculation. 


6.8  TORSION  OR  DIHEDRAL  ANGLE  COHERENCY 

MOPAC  calculations  do  not  distinguish  between  enantiomers, 
consequently  the  sign  of  the  dihedrals  can  be  multiplied  by  -1  and  the 
calculations  will  be  unaffected.  However,  if  chirality  is  important,  a 
user  should  be  aware  of  the  sign  convention  used. 

The  dihedral  angle  convention  used  in  MOPAC  is  that  defined  by 
Klyne  and  Prelog  in  Experientia  16,  521  (1960).  In  this  convention, 
four  atoms,  AXYB,  with  a  dihedral  angle  of  90  degrees,  will  have  atom  B 
rotated  by  90  degrees  clockwise  relative  to  A  when  X  and  Y  are  lined  up 
in  the  direction  of  sight,  X  being  nearer  to  the  eye.  In  their  words, 
"To  distinguish  between  enantiomeric  types  the  angle  'tau'  is  considered 
as  positive  when  it  is  measured  clockwise  from  the  front  substituent  A 
to  the  rear  substituent  B,  and  negative  when  it  is  measured 
anticlockwise."  The  alternative  convention  was  used  in  all  earlier 
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programs,  including  QCPE  353. 


6.9  VIBRATIONAL  ANALYSIS 

Analyzing  normal  coordinates  is  very  tedious.  Users  are  normally 
familiar  with  the  internal  coordinates  of  the  system  they  are  studying, 
but  not  familiar  with  the  cartesian  coordinates.  To  help  characterize 
the  normal  coordinates,  a  very  simple  analysis  is  done  automatically, 
and  users  are  strongly  encouraged  to  use  this  analysis  first,  and  then 
to  look  at  the  normal  coordinate  eigenvectors. 

In  the  analysis,  each  pair  of  bonded  atoms  is  examined  to  see  if 
there  is  a  large  relative  motion  between  them.  By  bonded  is  meant 
within  the  Van  der  Waals'  distance.  If  there  is  such  a  motion,  the 
indices  of  the  atoms,  the  relative  distance  in  Angstroms,  and  the 
percentage  radial  motion  are  printed.  Radial  plus  tangential  motion 
adds  to  100%,  but  as  there  are  two  orthogonal  tangential  motions  and 
only  one  radial,  the  radial  component  is  printed. 


6.10  A  NOTE  ON  THERMOCHEMISTRY 


By 

Tsuneo  Hirano 

Department  of  Synthetic  Chemistry 
Faculty  of  Engineering 
University  of  Tokyo 
Hongo,  Bunkyo-ku,  Tokyo,  Japan 

1)  Basic  Physical  Constants 

"Quantities,  Units  and  Symbols  in  Physical  Chemistry," 

Blackwell  Scientific  Publications  Ltd,  Oxford  0X2  OEL,  UK,  1987 
( IUPAC,  based  on  CODATA  of  ICSU,  1986).  pp  81-82. 


Speed  of  light 

c 

= 

2.997 

92458  D10 

cm/s  (Definition) 

Boltzmann  constant 

k 

= 

R/Na 

= 

1.380 

658  D-23 

J/K 

= 

1.380 

658  D-16 

erg/K 

Planck  constant 

h 

a 

6.626 

0755  D-34 

J  s 

= 

6.626 

0755  D-27 

erg  s 

Gas  constant 

R 

= 

8.314 

510  J/mol/K 

= 

1.987 

216  cal/raol/K 
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Avogadro  number  Na  =  6.022  1367  D23  /mol 

Volume  of  1  mol  of  gas  VO  =  22.414  10  1/mol  (at  1  atm,  25  C) 

1  J  =  1.D7  erg 

1  kcal  =  4.184  kJ  (Definition) 

1  ev  =  23.060  6  kcal/mol 

1  a.u.  =  27.211  35  eV/mol  =  627.509  6  kcal/mol 
1  cm-1  =  2.859  144  cal/mol  (=  Na  h  c  /  4.184D7) 

1  atm  =  1.013  25  D5  Pa  =  1.013  25  D6  dyn/cm**2  (Definition) 

****** 


Moment  of  inertia:  I 

1  amu  angstrom**2  =  1.660  540  D-40  g  cm**2 

Rotational  constants:  A,  B,  and  C  (e.g.  A  -  h/( 8*pi*pi*I ) ) 

A(in  MHz)  =  5.053  791  D5  /  I (in  amu  angstrom**2) 

A(in  cm-1)  =  5.053  791  D5/  c/  I (in  amu  angstrom**2) 

=  16.857  63  /  I (in  amu  angstrom**2) 

2)  Thermochemistry  from  ab  initio  MO  methods. 

Ab  initio  MO  methods  provide  total  energies,  Eeq,  as  the  sum  of 
electronic  and  nuclear-nuclear  repulsion  energies  for  molecules, 
isolated  in  vacuum,  without  vibration  at  0  K. 

Eeq  =  Eel  +  Enuclear-nuclear  (1) 


From  the  0  K-potential  surface  and  using  the  harmonic  oscillator 
approximation,  we  can  calculate  the  vibrational  frequencies,  vi,  of  the 
normal  modes  of  vibration.  Using  these,  we  can  calculate  vibrational, 
rotational  and  translational  contributions  to  the  thermodynamic 
quantities  such  as  the  partition  function  and  heat  capacity  which  arise 
from  heating  the  system  from  0  to  T  K. 

Q:  partition  function 
E:  energy 
S:  entropy 
C:  heat  capacity 
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[Vibration] 

Qvib  =  sum  over  i  {  1/(1  -  exp(-hvi/kT) )}  (2) 

Evib,  for  a  molecule  at  the  temperature  T  as 

Evib  =  sum  over  i  {(l/2)hvi  +  hvi*exp(-hvi/kT)/(l  -  exp(-hvi/kT) ) } 

(3) 

where  h  is  the  Planck  constant,  vi  the  i-th  normal  vibration  frequency, 
and  k  the  Boltzmann  constant.  For  1  mole  of  molecules,  Evib  should  be 
multiplied  by  the  Avogadro  number  Na(-  gas  constant  R/k).  Thus, 

Evib  =  Na  *  sum  over  i  {(l/2)hvi 

+  hvi*exp(-hvi/kT)/(l-exp(-hvi/kT) ) }  (4) 

Note  that  the  first  term  in  Eq.  4  is  the  Zero-point  vibration 

energy.  Hence,  the  second  term  in  Eq.  4  is  the  additional  vibrational 
contribution  due  to  the  temperature  increase  from  0  K  to  T  K.  Namely, 

Evib  =  Ezero  +  Evib(0 — >T)  (5) 

Ezero  =  Na  *  sum  over  i  {(l/2)hvi),  (6) 

Evib(0 — >T)  =  Na  *  sum  over  i  {hvi*exp(-hvi/kT)/(l  -  exp(-hvi/kT) ) } . 

(7) 

The  value  of  Evib  from  GAUSSIAN  82  and  86  includes  Ezero  as  defined  by 
Eqs.  4-7. 

Svib  =  R  sum  over  i  { (hvi/kT)*exp(-hvi/kT)/(l  -  exp(-hvi/kT) ) 

-  ln(l  -  exp(-hvi/kT) )}  (8) 

Cviv  =  R  sum  over  i  { ( (hvi/kT)**2)  exp(-hvi/kT)/ 

(1  -  exp(-hvi/kT) )**2}  (9) 

At  temperature  T  (>0  K),  a  molecule  rotates  about  the  x,  y,  and 
z-axes  and  translates  in  x,  y,  and  z-directions.  By  assuming  the 
equipartition  of  energy,  energies  for  rotation  and  translation,  Erot  and 
Etr,  are  calculated. 
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(Rotation] 

(sym)  is  symmetry  number. 

I  is  moment  of  inertia. 

IAr  IB,  and  IC  are  moments  of  inertia  about  A,  B,  and  C  axes. 


<linear  molecule> 

Qrot  =  (l/(sym) ) (8{pi**2)  I  kT/  h**2]  (10) 

Erot  =  ( 2/2 )RT  (11) 

Srot  =  R  In  [ (l/(sym) ) * ( 8 ( pi *  *  2 )  I/h**2)*kT]  +  R  (12) 

=  R  In  I  +  R  In  T  -  R  ln(sym)  -  4.349  203  (13) 

where 

-4.349  203  =  R  ln{8(pi**2) (1/Na) (l.D-8)**2  k/h**2>  +  R. 

Crot  «  ( 2/2)R  (14) 

<non-linear  molecule> 


Qrot  =  (pi**0.5/(sym) )  (8(pi**2)kT/h**2]**(3/2)  (IA  IB  IC)**(l/2) 
=  (pi**0 . 5/( sym) )  { ( S(pi**2) ( IA)c/h] * [8 (pi**2) ( IB)c/h]* 


(8(pi**2) (IC)c/h]}**(l/2)  (kT/hc)**(3/2)  (15) 

Erot  =  ( 3/2 )RT  (16) 

Srot  =  (R/2 )  In  { (pi/(sym)**2)  (8(pi**2) (IA)c/h]* 

[8 (pi** 2) (IB)c/h]*(8(pi**2) (IC)c/h]*[ (kT/hc)**3]} 

+  (3/2)R  (17) 

=  (R/2)  In  (IA  IB  IC)  +  (3/2)  R  In  T  -  R  In  (sym)  -  5.386  3921 
Here,  -5.386  3921  is  calculated  as 

R  In  { ( ( ( 1 .D-8) **2/Na) **3) **(1/2)  (2**9  pi**7  k*3)**(l/2)  /  h**3}  +  (3/2)R. 
Crot  =  ( 3/2 )R  (18) 
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[Translation] 

M  is  Molecular  weight. 

Qtra  =  {[2  (pi)  (M/Na)  kT]**(l/2)  /  h}**3  (19) 

Etra  =  ( 3/2 )RT  (20) 

(  or  Htra  =  (5/2)RT  due  to  the  pV  terra  (cf.  H  =  U  +  pV)  ) 
Stra  =  R  {  (5/2)  +  (3/2)ln  [ 2 ( pi ) k/h* * 2 ]  +  In  k  +  (3/2)ln  (M/Na) 

+  ( 5/2)ln  T  -  In  p  }  (21) 

■  ( 5/2)R  In  T  +  ( 3/2)R  In  M  -  R  In  p  -  2.31482  (22) 

Ctra  =  ( 5/2 )R  (23) 

The  internal  energy  U  at  T  is 

U  =  Eeq  +  [Evib  +  Erot  +  Etra]  (24) 

or 

U  =  Eeq  +  [(Ezero  +  Evib(0 — >T ) )  +  Erot  +  Etra]  (25) 

Enthalpy  H  for  one  mole  of  gas  is  defined  as 

H  =  U  +  pV  (26) 

Assumption  of  an  ideal  gas  (i.e.,  pV  =  RT)  leads  to 

H  =  U  +  pV  =  U  +  RT  (27) 

Thus,  Gibbs  free  energy  G  can  be  calculated  as 

G  =  H  -  T*S( 0 — >T)  (28) 


2) Thermochemistry  in  MOPAC 

It  should  be  noted  that  MO  parameters  for  MINDO/3,  MNDO,  AMI  and 
PM3  are  optimized  so  as  to  reproduce  the  experimental  heat  of  formation 
(i.e.,  standard  enthalpy  of  formation  or  the  enthalpy  change  to  form  a 
mole  of  compound  at  25  degrees  C  from  its  elements  in  their  standard 
state)  as  well  as  observed  geometries  (mostly  at  25  degrees  C),  and  not 
to  reproduce  the  Eeq  and  equilibrium  geometry  at  0  K. 

In  this  sense,  Escf  (defined  as  Beat  of  formation),  force 
constants,  normal  vibration  frequencies  etc  are  all  related  to  the 
values  at  25  degree  C,  not  to  0  Kill  II  Therefore,  the  Ezero  calculated 
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in  FORCE  is  not  the  true  Ezero.  Its  use  as  Ezero  should  be  made  at  your 
own  risk,  bearing  in  mind  the  situation  discussed  above. 

Since  Escf  is  standard  enthalpy  of  formation  (at  25  degree  C), 

Escf  =  lEeq  +  Ezero  +  Evib{0 — >298.15)  +  Erot  +  Etra  +  pV] 

+  sum  (  -  Electronic  energy  of  atom  +  Delta-H  of  formation  of  atom]. 

(29) 

To  avoid  the  complication  arising  from  the  definition  of  Escf,  within 
the  thermodynamics  calculation  the  Standard  Enthalpy  of  Formation, 
Delta-H,  is  calculated  by 


Delta-H  =  Escf  +  (HT  -  H298).  (30) 

Here,  Escf  is  the  heat  of  formation  (at  25  degree  C)  given  in  the  output 
list,  and  HT  and  H298  are  the  enthalpy  contributions  for  the  increase  of 
the  temperature  from  0  K  to  T  and  298.15,  respectively.  In  other  words, 
the  enthalpy  of  formation  is  corrected  for  the  difference  in  temperature 
from  298.15  K  to  T.  The  method  of  calculation  for  T  and  H298  will  be 


given  below. 

In  MOPAC,  the  variables  defined  below  are  used 

Cl  =  he  /  kT  (31) 

Wi  (in  cm-1)  (i.e.,  vi  =  Wi*c  ),  (32) 

EWJ  =  exp(  -hvi/kT)  =  exp(  -Wi*hc/kT)  =  exp(-  Wi*Cl)  (33) 

A,  B,  and  C  in  cm-1  (i.e.,  A  =  [h/(8(pi**2)IA*c) ] ) ,  (34) 


Energy  and  Enthalpy  in  cal/mol,  and  Entropy  in  cal/mol/K. 

Thus,  eqs.  2-28  can  be  written  as  follows. 

[Vibration] 

Qviv  =  pi  over  i  {l  /  (1  -  EWJ)}  (35) 

E0  =  [0.5  Na  h  c/( 4.184  D7)]  sum  over  i  {wi}  (36) 

=  1.429  572  *  sum  over  i  {wi}  (37) 
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Evib(0->T)  =  Na  h  c  sum  over  i  {  Wi*EWJ  /  (1  -  EWJ)  } 

=  (R/k)  h  c  sum  over  i  {  Wi*EWJ  /  (1  -  EWJ)  }  (38) 

Svib  =  R  (hc/kT)  sum  over  i  {WI*EWJ/(1  -  EWJ)} 

-  R  sum  over  i  {in  (1  -  EWJ)} 

=  R  Cl  sum  over  i  {WI*EWJ/(1  -  EWJ)} 

-  R  sum  over  i  {in  (1  -  EWJ)}  (39) 

Cvib  =  R  (hc/kT) **2  sum  over  i  {  Wi**2  EWJ  /(l-  EWJ)**2  } 

=  R  Cl**2  sum  over  i  {  Wi**2  EWJ  / ( 1—  EWJ)**2  }  (40) 


[Rotation] 

<linear  molecule> 

Qrot  =  (1/ (sym) )  (1/A)  (kT/hc) 

=  1/ [ (sym)  A  Cl]  (41) 

Erot  =  ( 2/2 )RT  (42) 

Srot  =  R  In  [ (l/(sym) )  (1/A)  (kT/hc)]  +  R 
=  R  In  [l/(  ( sym) *A*C1  ) ]  +  R 

=  R  In  [  Tk/(  h*c*A*sym)]  +  R  (43) 

Crot  =  ( 2/2 )R  (44) 

<non-linear  molecule> 

Qrot  =  [  pi  /  (A  B  C  Cl**3 ) ] ** ( 1/2 )  /  (sym)  (45) 

Erot  =  ( 3/2 )RT  (46) 


Srot  =  (R/2 )  In  {  (l/(sym)**2)  (1/A)  (1/B)  (1/C)  (pi)  (kT/hc)**3  } 
+  ( 3/2)R 

=  0.5R  {  3  In  (kT/hc)  -  2  In  (sym)  +  In  (pi/(A  B  C))  +  3} 

(47) 

=  0.5R  {  -3  In  Cl  -2  In  (sym)  +  In  (pi/(A  B  C))  +  3} 


Crot  =  (3/2)R 


(48) 
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[Translation] 

Qtra  =  [  (  2  pi  (M/Na)  kT)**(l/2)  /  h]**3 

=  [  (2  pi  H  k  T  *  1.660540D-24)**(l/2)  /h]**3  (49) 

Etra  =  ( 3/2JRT  (50) 

Htra  =  ( 3/2 )RT  +  pV  =  (5/2)RT  (cf.  pV  =  RT)  (51) 

Stra  =  (R/2)  [  5  In  T  +  3  In  M  ]  -  2.31482  (cf.  p  =  1  atm) 

=  0.993608  [  5  In  T  +  3  In  M]  -  2.31482  (52) 

In  NOPAC  , 

Hvib  =  Evib(0 — >T)  (53) 

(Note:  Ezero  is  <not>  included  in  Hvib. 


Wi  is  not  derived  from  force-constants  at  0  K) 

and 

HT  =  [Hvib  +  Hrot  +  Htra]  for  T.  (54) 

H298  =  [Hvib  +  Hrot  +  Htra]  for  T  =  298.15.  (55) 

Note  that  HT  (and  H298)  is  equivalent  to 

[(Evib  -  Ezero)  +  Erot  +  (Etra  +  pV) ]  (56) 

except  that  the  normal  frequencies  are  those  obtained  from  force 
constants  at  25  degree  C,  or  at  least  not  at  0  K. 

Thus,  Standard  Enthalpy  of  Formation,  Delta-H,  can  be  calculated 
according  to  Eqs.  25,  26  and  29,  as  shown  in  Eq.  30; 

Delta-H  =  Escf  +  (HT  -  H298)  (57) 

Note  that  Ezero  is  already  counted  in  Escf  (see  Eq.  29). 

By  using  Eq.  27,  Standard  Internal  Energy  of  Formation,  Delta-U, 
can  be  calculated  as 

Delta-U  =  Delta-H  -  R(T  -  298.15).  (58) 

Standard  Gibbs  Free-Energy  of  Formation,  Delta-G,  can  be  calculated 
by  taking  the  difference  from  that  for  the  isomer  or  that  at  different 
temperature. 
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Delta-G  =  [Delta-H  -  T*S]  for  the  state  under  consideration 

-  [Delta-H  -  T*S]  for  reference  state.  (59) 

Taking  the  difference  is  necessary  to  cancel  the  unknown  values  of 
standard  entropy  of  formation  for  the  constituent  elements. 


6.11  REACTION  COORDINATES 

The  Intrinsic  Reaction  Coordinate  method  pioneered  and  developed  by 
Mark  Gordon  has  been  incorporated  in  a  modified  form  into  MOPAC.  As 
this  facility  is  quite  complicated  all  the  keywords  associated  with  the 
IRC  have  been  grouped  together  in  this  section. 


Definitions  of  Terms 


DRC 

The  Dynamic  Reaction  Coordinate  is  the  path  followed  by  all  the 
atoms  in  a  system  assuming  conservation  of  energy,  i.e.,  as  the 
potential  energy  changes  the  kinetic  energy  of  the  system  changes  in 
exactly  the  opposite  way  so  that  the  total  energy  (kinetic  plus 
potential)  is  a  constant.  If  started  at  a  ground  state  geometry,  no 
significant  motion  should  be  seen.  Similarly,  starting  at  a  transition 
state  geometry  should  not  produce  any  motion  -  after  all  it  is  a 
stationary  point  and  during  the  lifetime  of  a  calculation  it  is  unlikely 
to  accumulate  enough  momentum  to  travel  far  from  the  starting  position. 

In  order  to  calculate  the  DRC  path  from  a  transition  state,  either 
an  initial  deflection  is  necessary  or  some  initial  momentum  must  be 
supplied. 

Because  of  the  time-dependent  nature  of  the  DRC  the  time  elapsed 
since  the  start  of  the  reaction  is  meaningful,  and  is  printed. 


Description 

The  course  of  a  molecular  vibration  can  be  followed  by  calculating 
the  potential  and  kinetic  energy  at  various  times.  Two  extreme 
conditions  can  be  identified:  (a)  gas  phase,  in  which  the  total  energy 
is  a  constant  through  time,  there  being  no  damping  of  the  kinetic  energy 
allowed,  and  (b)  liquid  phase,  in  which  kinetic  energy  is  always  set  to 
zero,  the  motion  of  the  atoms  being  infinitely  damped. 

All  possible  degrees  of  damping  are  allowed.  In  addition,  the 
facility  exists  to  dump  energy  into  the  system,  appearing  as  kinetic 
energy.  As  kinetic  energy  is  a  function  of  velocity,  a  vector  quantity, 
the  energy  appears  as  energy  of  motion  in  the  direction  in  which  the 
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molecule  would  naturally  move.  If  the  system  is  a  transition  state, 
then  the  excess  kinetic  energy  is  added  after  the  intrinsic  kinetic 
energy  has  built  up  to  at  least  0.2Kcal/mole. 

For  ground-state  systems,  the  excess  energy  sometimes  may  not  be 
added;  if  the  intrinsic  kinetic  energy  never  rises  above  0.2kcal/mole 
then  the  excess  energy  will  not  be. added. 


Equations  Used 
Force  acting  on  any  atom 

g(i)  +  g'(i)t  +  g"(i)t**2  =  dE/dx(i)  +  d**2E/dx(i)**L  +d**3E/dx( i ) **3 
Acceleration  due  to  force  acting  on  each  atom 
a(i)  *  (g(i)+g'(i)t+g"(i)t**2)/M(i) 

New  velocity 

V(o)+Dt*g(i)/M(i)+l/2*Dt**2*g,{i)/M(i)+/3*Dt**3*g"(i)/M(i) 

or 

V(i )  =  V(i )  +  V’(i)t  +  V"(i)t**2  +  V'"(i)t**3 

That  is,  the  change  in  velocity  is  equal  to  the  integral  over  the 
time  interval  of  the  acceleration. 

New  position  of  atoms 

X(i)  =  X(o)  +  V(o)t  +  l/2*V't**2  +  l/3*V"t**3  +  l/4*V,"t**4 

That  is,  the  change  in  position  is  equal  to  the  integral  over  the 
time  interval  of  the  velocity. 

The  velocity  vector  is  accurate  to  the  extent  that  it  takes  into 
account  the  previous  velocity,  the  current  acceleration,  the  predicted 
acceleration,  and  the  change  in  predicted  acceleration  over  the  time 
interval.  Very  little  error  is  introduced  due  to  higher  order 
contributions  to  the  velocity;  those  that  do  occur  are  absorbed  in  a 
re-normalization  of  the  magnitude  of  the  velocity  vector  after  each  time 
interval. 

The  magnitude  of  Dt,  the  time  interval,  is  determined  mainly  by  the 
factor  needed  to  re-normalize  the  velocity  vector.  If  it  is 
significantly  different  from  unity,  Dt  will  be  reduced;  if  it  is  very 
close  to  unity,  Dt  will  be  increased. 

Even  with  all  this,  errors  creep  in  and  a  system,  started  at  the 
transition  state,  is  unlikely  to  return  precisely  to  the  transition 
state  unless  an  excess  kinetic  energy  is  supplied,  for  example 
0 . 2Kcal/mole . 
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The  calculation  is  carried  out  in  cartesian  coordinates,  and 
converted  into  internal  coordinates  for  display.  All  cartesian 
coordinates  must  be  allowed  to  vary,  in  order  to  conserve  angular  and 
translational  momentum. 


IRC 

The  Intrinsic  Reaction  Coordinate  is  the  path  followed  by  all  the 
atoms  in  a  system  assuming  all  kinetic  energy  is  completely  lost  at 
every  point,  i.e.,  as  the  potential  energy  changes  the  kinetic  energy 
generated  is  annihilated  so  that  the  total  energy  (kinetic  plus 
potential)  is  always  equal  to  the  potential  energy  only. 

The  IRC  is  intended  for  use  starting  with  the  transition  state 
geometry.  A  normal  coordinate  is  chosen,  usually  the  reaction 
coordinate,  and  the  system  is  displaced  in  either  the  positive  or 
negative  direction  along  this  coordinate.  The  internal  modes  are 
obtained  by  calculating  the  mass-weighted  Hessian  matrix  in  a  force 
calculation  and  translating  the  resulting  cartesian  normal  mode 
eigenvectors  to  conserve  momentum.  That  is,  the  initial  cartesian 
coordinates  are  displaced  by  a  small  amount  proportional  to  the 
eigenvector  coefficients  plus  a  translational  constant;  the  constant  is 
required  to  ensure  that  the  total  translational  momentum  of  the  system 
is  conserved  as  zero.  At  the  present  time  there  may  be  small  residual 
rotational  components  which  are  not  annihilated;  these  are  considered 
unimportant. 


General  Description  of  the  DRC  and  IRC. 


As  the  IRC  usually  requires  a  normal  coordinate,  a  force  constant 
calculation  normally  has  to  be  done  first.  If  IRC  is  specified  on  its 
own  a  normal  coordinate  is  not  used  and  the  IRC  calculation  is  performed 
on  the  supplied  geometry. 

A  recommended  sequence  of  operations  to  start  an  IRC  calculation  is 
as  follows: 

1.  Calculate  the  transition  state  geometry.  If  the  T/S  is  not 
first  optimized,  then  the  IRC  calculation  may  give  very 
misleading  results.  For  example,  if  NH3  inversion  is  defined 
as  the  planar  system  but  without  the  N-H  bond  length  being 
optimized  the  first  normal  coordinate  might  be  for  N-H  stretch 
rather  than  inversion.  In  that  case  the  IRC  will  relax  the 
geometry  to  the  optimized  planar  structure. 

2.  Do  a  normal  FORCE  calculation,  specifying  ISOTOPE  in  order  to 
save  the  FORCE  matrices.  Do  not  attempt  to  run  the  IRC 
directly  unless  you  have  confidence  that  the  FORCE  calculation 
will  work  as  expected.  If  the  IRC  calculation  is  run  directly, 
specify  ISOTOPE  anyway:  that  will  save  the  FORCE  matrix  and  if 
the  calculation  has  to  be  re-done  then  RESTART  will  work 
correctly. 
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3.  Using  IRC=n  and  RESTART  cun  the  IRC  calculation.  If  RESTART  is 
specified  with  IRC-n  then  the  restart  is  assumed  to  be  from  the 
FORCE  calculation.  If  RESTART  is  specified  without  IRC=n,  say 
with  IRC  on  its  own,  then  the  restart  is  assumed  to  be  from  an 
earlier  IRC  calculation  that  was  shut  down  before  going  to 
completion. 


A  DRC  calculation  is  simpler  in  that  a  force  calculation  is  not  a 
prerequisite;  however,  most  calculations  of  interest  normally  involve 
use  of  an  internal  coordinate.  For  this  reason  IRC=n  can  be  combined 
with  DRC  to  give  a  calculation  in  which  the  initial  motion  (0.3Kcal 
worth  of  kinetic  energy)  is  supplied  by  the  IRC,  and  all  subsequent 
motion  obeys  conservation  of  energy.  The  DRC  motion  can  be  modified  in 
three  ways: 

1.  It  is  possible  to  calculate  the  reaction  path  followed  by  a 
system  in  which  the  generated  kinetic  energy  decays  with  a 
finite  half-life.  This  can  be  defined  by  DRC=n.nnn,  where 
n.nnn  is  the  half-life  in  femtoseconds.  If  n.nn  is  0.0  this 
corresponds  to  infinite  damping  simulating  the  IRC.  A 
limitation  of  the  program  is  that  time  only  has  meaning  when 
DRC  is  specified  without  a  half-life. 

2.  Excess  kinetic  energy  can  be  added  to  the  calculation  by  use  of 
KINETIC=n.nn.  After  the  kinetic  energy  has  built  up  to 
0.2Kcal/mole  or  if  IRC=n  is  used  then  n.nn  Kcal/mole  of  kinetic 
energy  is  added  to  the  system.  The  excess  kinetic  energy 
appears  as  a  velocity  vector  in  the  same  direction  as  the 
initial  motion. 

3.  The  RESTART  file  <f ilename> .RES  can  be  edited  to  allow  the  user 
to  modify  the  velocity  vector  or  starting  geometry.  This  file 
is  formatted. 


Frequently  DRC  leads  to  a  periodic,  repeating  orbit.  One  special 
type  -  the  orbit  in  which  the  direction  of  motion  is  reversed  so  that 
the  system  retraces  its  own  path  -  is  sensed  for  and  if  detected  the 
calculation  is  stopped  after  exactly  one  cycle.  If  the  calculation  is 
to  be  continued,  the  keyword  GEO-OK  will  allow  this  check  to  be 
by-passed. 

Due  to  the  potentially  very  large  output  files  that  the  DRC  can 
generate  extra  keywords  are  provided  to  allow  selected  points  to  be 
printed.  After  the  system  has  changed  by  a  preset  amount  the  following 
keywords  can  be  used  to  invoke  a  print  of  the  geometry. 
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Keyword  Default  User  Specification 

X-PRIO  0.05  Angstroms  X-PRIORITY=n.nn 

T-PRIO  0.10  Femtoseconds  T-PRIORITY=n . nn 

H-PRIO  0.10  Kcal/mole  H-PRIORITY=n . nn 

Option  to  allow  only  extrema  to  be  output 

In  the  geometry  specification,  if  an  internal  coordinate  is  marked 
for  optimization  then  when  that  internal  coordinate  passes  through  an 
extremum  a  message  will  be  printed  and  the  geometry  output. 

Difficulties  can  arise  from  the  way  internal  coordinates  are 
processed.  The  internal  coordinates  are  generated  from  the  cartesian 
coordinates,  so  an  internal  coordinate  supplied  may  have  an  entirely 
different  meaning  on  output.  In  particular  the  connectivity  may  have 
changed.  For  obvious  reasons  dummy  atoms  should  not  be  used  in  the 
supplied  geometry  specification.  If  there  is  any  doubt  about  the 
internal  coordinates  or  if  the  starting  geometry  contains  dummy  atoms 
then  run  a  1SCF  calculation  specifying  XYZ.  This  will  produce  an  ARC 
file  with  the  "ideal"  numbering  -  the  internal  numbering  system  used  by 
MOPAC.  Use  this  ARC  file  to  construct  a  data  file  suitable  for  the  DRC 
or  IRC. 

Notes 

1.  Any  coordinates  marked  for  optimization  will  result  in  only 
extrema  being  printed. 

2.  If  extrema  are  being  printed  then  kinetic  energy  extrema  will 
also  be  printed. 


Keywords  for  use  with  the  IRC  and  DRC 

1.  Setting  up  the  transition  state:  NLLSQ  SIGMA  TS. 

2.  Constructing  the  FORCE  matrix:  FORCE  or  IRC=n,  ISOTOPE,  LET. 

3.  Starting  an  IRC:  RESTART  and  IRC*n,  T-PRIO,  X-PRIO,  H-PRIO. 

4.  Starting  a  DRC:  DRC  or  DRC=n.nn,  KINETIC=n.nn. 

5.  Starting  a  DRC  from  a  transition  state:  (DRC  or  DRC=n )  and 

IRC=n,  KINETIC=n. 

6.  Restarting  an  IRC:  RESTART  and  IRC. 

7.  Restarting  a  DRC:  RESTART  and  (DRC  or  DRC-n.nn). 

8.  Restarting  a  DRC  starting  from  a  transition  state:  RESTART  and 
( DRC  or  DRC-n . nn ) . 
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Other  keywords,  such  as  T=nnn  or  GEO-OK  can  be  used  anytime. 


Examples  of  DRC/1RC  data 


Use  of  the  IRC/DRC  facility  is  quite  complicated.  In  the  following 
examples  various  "reasonable"  options  are  illustrated  for  a  calculation 
on  water. 

It  is  assumed  that  an  optimized  transition-state  geometry  is 
available. 

Example  1:  A  Dynamic  Reaction  Coordinate,  starting  at  the 
transition  state  for  water  inverting,  initial  motion  opposite  to  the 
transition  normal  mode,  with  6kcal  of  excess  kinetic  energy  added  in. 
Every  point  calculated  is  to  be  printed  (Note  all  coordinates  are  marked 
with  a  zero,  and  T-PRIO,  H-PRIO  and  X-PRIO  are  all  absent).  The  results 
of  an  earlier  calculation  using  the  same  keywords  is  assumed  to  exist. 
The  earlier  calculation  would  have  constructed  the  force  matrix.  While 
the  total  cpu  time  is  specified,  it  is  in  fact  redundant  in  that  the 
calculation  will  run  to  completion  in  less  than  600  seconds. 


KINETIC=6  RESTART  IRC=-1  DRC  T=600 
WATER 


H 

O 

H 

0 


0.000000  0 

0.911574  0 

0.911574  0 

0.000000  0 


0.000000  0 

0.000000  0 

180.000000  0 

0.000000  0 


0.000000  0 

0.000000  0 

0.000000  0 

0.000000  0 


0  0  0 
10  0 
2  10 
0  0  0 


Example  2:  An  Intrinsic  Reaction  Coordinate  calculation.  Here  the 
restart  is  from  a  previous  IRC  calculation  which  was  stopped  before  the 
minimum  was  reached.  Recall  that  RESTART  with  lRC=n  implies  a  restart 
from  the  FORCE  calculation.  Since  this  is  a  restart  from  within  an  IRC 
calculation  the  keyword  IRC=n  has  been  replaced  by  IRC.  IRC  on  its  own 
(without  the  "-n")  implies  an  IRC  calculation  from  the  starting  position 
-  here  the  RESTART  position  -  without  initial  displacement. 
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RESTART  IRC  T=600 
WATER 


H 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

0 

0.911574 

0 

0.000000 

0 

0.000000 

0 

1 

0 

0 

H 

0.911574 

0 

180.000000 

0 

0.000000 

0 

2 

1 

0 

0 

0.000000 

0 

0.000000 

0 

O.OOOOCO 

0 

0 

0 

0 

Output  Format 

for  IRC  and 

DRC 

The  IRC  and 

DRC 

can  produce 

several  different 

forms  of  output. 

Because  of  the  large  size  of  these  outputs,  users  are  recommended  to  use 
search  functions  to  extract  information.  To  facilitate  this,  specific 
lines  have  specific  characters.  Thus,  a  search  for  the  "%"  symbol  will 
summarize  the  energy  profile  while  a  search  for  "AA"  will  yield  the 
coordinates  of  atom  1,  whenever  it  is  printed.  The  main  flags  to  use  in 
searches  are: 


SEARCH  FOR 

'%  ' 

1  %M ' 

' %MAX 1 
' %MIN' 

•AA*' 

'AE*' 

' 123AB* 1 


YIELDS 

Energies  for  all  points  calculated, 

excluding  extrema 

Energies  for  all  turning  points 

Energies  for  all  maxima 

Energies  for  all  minima 

Energies  for  all  points  calculated 

Internal  coordinates  for  atom  1  for  every  point 

Internal  coordinates  for  atom  5  for  every  point 

Internal  coordinates  for  atom  5  for  point  123 


As  the  keywords  for  the  IRC/DRC  are  interdependent,  the  following 
list  of  keywords  illustrates  various  options. 


Keyword 


Resulting  Action 


DRC 

DRC=0 . 5 
DRC=-1 . 0 


IRC 


IRC=-4 

IRC=1  KINETIC=1 
DRC  KINETIC=5 


The  Dynamic  Reaction  Coordinate  is  calculated. 
Energy  is  conserved,  and  no  initial  impetus. 

In  the  DRC  kinetic  energy  is  lost  with  a  half- 
life  of  0.5  femtoseconds. 

Energy  is  put  into  a  DRC  with  an  half-life  of 
-1.0  femtoseconds,  i.e.,  the  system  gains 
energy. 

The  Intrinsic  Reaction  Coordinate  is 
calculated.  No  initial  impetus  is  given. 

Energy  not  conserved. 

The  IRC  is  run  starting  with  an  impetus  in  the 
negative  of  the  4th  normal  mode  direction.  The 
impetus  is  one  quantum  of  vibrational  energy. 
The  first  normal  mode  is  used  in  an  IRC,  with 
the  initial  impetus  being  1 .OKcal/mole. 

In  a  DRC,  after  the  velocity  is  defined,  5  Kcal 
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of  kinetic  energy  is  added  in  the  direction  of 
the  initial  velocity. 

IRC=1  DRC  KINETIC=4  After  starting  with  a  4  kcal  impetus  in  the 

direction  of  the  first  normal  mode,  energy  is 
conserved. 

DRC  VELOCITY  KINETIC=10  Follow  a  DRC  trajectory  which  starts  with  an 

initial  velocity  read  in,  normalized  to  a 
kinetic  energy  of  10  kcal/mol. 

Instead  of  every  point  being  printed,  the  option  exists  to  print 
specific  points  determined  by  the  keywords  T-PRIORITY,  X-PRIORITY  and 
H-PRIORITY .  If  any  one  of  these  words  is  specified,  then  the  calculated 
points  are  used  to  define  quadratics  in  time  for  all  variables  normally 
printed.  In  addition,  if  the  flag  for  the  first  atom  is  set  to  T  then 
all  kinetic  energy  turning  points  are  printed.  If  the  flag  for  any 
other  internal  coordinate  is  set  to  T  then,  when  that  coordinate  passes 
through  an  extremum,  that  point  will  be  printed.  As  with  the  PRIORITYs, 
the  point  will  be  calculated  via  a  quadratic  to  minimize  non-linear 
errors. 


N.b.:  Quadratics  are  unstable  in  the  regions  of  inflection  points, 
in  these  circumstances  linear  interpolation  will  be  used.  A  result  of 
this  is  that  points  printed  in  the  region  of  an  inflection  may  not 
correspond  exactly  to  those  requested.  This  is  not  an  error  and  should 
not  affect  the  quality  of  the  results. 


Test  of  DRC  -  Verification  of  Trajectory  Path 


Introduction:  Unlike  a  single-geometry  calculation  or  even  a 
geometry  optimization,  verification  of  a  DRC  trajectory  is  not  a  simple 
task.  In  this  section  a  rigorous  proof  of  the  DRC  trajectory  is 
presented;  it  can  be  used  both  as  a  test  of  the  DRC  algorithm  and  as  a 
teaching  exercise.  Users  of  the  DRC  are  asked  to  follow  through  this 
proof  in  order  to  convince  themselves  that  the  DRC  works  as  it  should. 

Part  1:  The  Nitrogen  Molecule 

For  the  nitrogen  molecule  and  using  MNDO,  the  equilibrium  distance 
is  1.103802  Angstroms,  the  heat  of  formation  is  8.276655  Kcal/mole  and 
the  vibrational  frequency  is  2739.6  cm(-l).  For  small  displacements, 
the  energy  curve  versus  distance  is  parabolic  and  the  gradient  curve  is 
approximately  linear,  as  is  shown  in  the  following  table.  A  nitrogen 
molecule  is  thus  a  good  approximation  to  a  harmonic  oscillator. 
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STRETCHING  CURVE  FOR  NITROGEN  MOLECULE 


H.O.F.  GRADIENT 

(Kcal/mole)  (Kcal/mole/Angstrom) 


N-N  DIST 
(Angstroms) 

1.1180 

1.1170 

1.1160 

1.1150 

1.1140 

1.1130 

1.1120 

1.1110 

1.1100 

1.1090 

1.1080 

1.1070 

1.1060 

1.1050 

1.1040 

1.1030 

1.1020 

1.1010 

1.1000 

1.0990 

1.0980 

1.0970 

1.0960 

1.0950 

1.0940 

1.0930 

1.0920 

1.0910 

1.0900 


8.714564 

8.655723 

8.601031 

8.550512 

8.504188 

8.462082 

8.424218 

8.390617 

8.361303 

8.336299 

8.315628 

8.299314 

8.287379 

8.279848 

8.276743 

8.278088 

8.283907 

8.294224 

8.309061 

8.328444 

8.352396 

8.380941 

8.414103 

8.451906 

8.494375 

8.541534 

8.593407 

8.650019 

8.711394 


60.909301 

56.770564 

52.609237 

48.425249 

44.218525 

39.988986 

35.736557 

31.461161 

27.162720 

22.841156 

18.496393 

14.128353 

9.736959 

5.322132 

0.883795 

-3.578130 

-8.063720 

-12.573055 

-17.106213 

-21.663271 

-26.244309 

-30.849404 

-35.478636 

-40.132083 

-44.809824 

-49.511939 

-54.238505 

-58.989621 

-63.765330 


Period  of  Vibration. 

The  period  of  vibration  (time  taken  for  the  oscillator  to  undertake 
one  complete  vibration,  returning  to  its  original  position  and  velocity) 
can  be  calculated  in  three  ways.  Most  direct  is  the  calculation  from 
the  energy  curve;  using  the  gradient  constitutes  a  faster,  albeit  less 
direct,  method,  while  calculating  it  from  the  vibrational  frequency  is 
very  fast  but  assumes  that  the  vibrational  spectrum  has  already  been 
calculated. 

(1)  From  the  energy  curve. 

For  a  simple  harmonic  oscillator  the  period  '  r'  is  given  by 
r  =  2*pi*sqrt(m/k) 

where  m  =  reduced  mass  and  k  =  force-constant.  The  reduced  mass  (in 
AMU)  of  a  nitrogen  molecule  is  14.0067/2  =  7.00335,  and  the 
force-constant  can  be  calculated  from 
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2 

E  -  C  =  l/2*k(R-Ro). 


Given  Ro  =  1.1038,  R  =  1.092,  c  = 

then 

2 

k  =  4548.2  Kcal/mole/A 


8.276655  and  E  =  8 . 593407Kcal/mol 


3  7  8  8  2 

=  4545  *  4.184  *  10  *  10  *  10  *  10  ergs/cm 

30  2 

=  1.9029  *  10  ergs/cm 


30 

Therefore,  r  *  2  *  3.14159  *  sqrt( 7. 0035/(1. 9029*10  ))  seconds 

-15 

*  12.054  *  10  seconds 
=  12.054  fS  (Femtoseconds) 

(2)  From  the  gradient  curve. 

The  force  constant  is  the  derivative  of  the  gradient  W.R.T. 
distance 

k  =  dG/dx 


Since  we  are  using  discrete  points,  the  force  constant  is  best 
obtained  from  finite  differences: 

k  =  (G2-Gl)/(x2-xl) 


For  x2  =  1.1100,  G2  =  27.163  and  for  xl  =  1.0980,  G1  =  -26.244, 

giving  rise  to  k  -  4450  kcal/mole/A/A  and  a  period  of  12.186  fS. 

(3)  From  the  vibrational  frequency. 


Given  a  "frequency"  of  vibration  of  N2  of  v=2739.6  cm(-l)  the 
period  of  oscillation  is  given  directly  by 


r  =  l/(v*c) 

=  1/(2739.6  *  2.998  *  10 


10 

)  seconds 


=  12.175  fS 
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Summarizing,  by  three  different  methods  the  period  of  oscillation 
of  N2  is  calculated  to  be  12.054,  12.186  and  12.175  fS,  average 
12.138fS. 


Initial  Dynamics  of  N2  Molecule  with  N-N  distance  =  1.094  Angstroms 


A  useful  check  on  the  dynamics  of  N2  is  to  calculate  the  initial 
acceleration  of  the  two  nitrogen  atoms  after  releasing  them  from  a 
starting  interatomic  separation  of  1.094  Angstroms. 


At  R(N-N) 
G 


Therefore  acceleration  f 


Distance  from  equilibrium 
After  0.1  fS  velocity 


1.094  Angstroms, 

-44.810  Kcal/mole/Angstrom 
19 

-18.749  *  10  ergs/cm 


19 

-18.749  *  10  /14.0067  cm/sec/sec 

18 

-13.386  *  10  cm/sec/sec 
15 

-13.386  *  10  *  Earth  surface  gravity! 

0.00980  Angstroms. 


-15  18 

0.1  *  10  *  {-13.386  *  10  )  cm/sec 


1338.6  cm/sec 


In  the  DRC  the  time-interval  between  points  calculated  is  a 
complicated  function  of  the  curvature  of  the  local  surface.  By  default, 
the  first  time-interval  is  0.105fS,  so  the  calculated  velocity  at  this 
time  should  be  0.105  *  1338.6  =  1405.6cm/sec,  in  the  DRC  calculation  the 
predicted  velocity  is  1405.6cm/sec. 

The  option  is  provided  to  allow  sampling  of  the  system  at  constant 
time-intervals,  the  default  being  O.lfS.  For  the  first  few  points  the 
calculated  velocities  are  as  follows. 
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TIME 

CALCULATED 

LINEAR 

DIFF. 

VELOCITY 

VELOCITY 

VELOCITY 

0.000 

0.0 

0.0 

0.0 

0.100 

1338.6 

1338.6 

0.0 

0.200 

2673.9 

2677.2 

-3.3 

0.300 

4001.0 

4015.8 

-14.8 

0.400 

5317.3 

5354.4 

-37.1 

0.500 

6618.5 

6693.0 

-74.5 

0.600 

7900.8 

8031.6 

-130.8 

As  the  calculated  velocity  is  a  fourth-order  polynomial  of  the 
acceleration,  and  the  acceleration,  its  first,  second  and  third 
derivatives,  are  all  changing,  the  predicted  velocity  rapidly  becomes  a 
poor  guide  to  future  velocities. 


For  simple  harmonic 

motion  the  velocity  at  any  time  is  given  by 

o 

> 

ii 

> 

*  sin(2*pi*t/r ) 

By  fitting 

the  computed  velocities  to  simple 

harmonic  motion,  a  much 

better  fit 

is  obtained 

Calculated 

Simple  Harmonic 

Diff 

Time 

Velocity 

25316. Sin(0.529t) 

0.000 

0.0 

0.0 

0.0 

0.100 

1338.6 

1338.6 

0.0 

0.200 

2673.9 

2673.4 

+0.5 

0.300 

4001.0 

4000.8 

+0.2 

0.400 

5317.3 

5317.0 

+0.3 

0.500 

6618.5 

6618.3 

+0.2 

0.600 

7900.8 

7901.0 

-0.2 

The  repeat-time  required  for  this  motion  is  11.88  fS,  in  good 
agreement  with  the  three  values  calculated  using  static  models.  The 
repeat  time  should  not  be  calculated  from  the  time  required  to  go  from  a 
minimum  to  a  maximum  and  then  back  to  a  minimum  —  only  half  a  cycle. 
For  all  real  systems  the  potential  energy  is  a  skewed  parabola,  so  that 
the  potential  energy  slopes  are  different  for  both  sides;  a  compression 
(as  in  this  case)  normally  leads  to  a  higher  force-constant,  and  shorter 
apparent  repeat  time  (as  in  this  case).  Only  the  addition  of  the  two 
half-cycles  is  meaningful. 
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Conservation  of  Normal  Coordinate 


So  far  this  analysis  has  only  considered  a  homonuclear  diatomic.  A 
detailed  analysis  of  a  large  polyatomic  is  impractical,  and  for 
simplicity  a  molecule  of  formaldehyde  will  be  studied. 

In  polyatomics,  energy  can  transfer  between  modes.  This  is  a 
result  of  the  non-parabolic  nature  of  the  potential  surface.  For  small 
displacements  the  surface  can  be  considered  as  parabolic.  This  means 
that  for  small  displacements  interconversion  between  modes  should  occur 
only  very  slowly.  Of  the  six  normal  modes,  mode  1,  at  1204.5  cm(-l), 
the  in-plane  C-H  asymmetric  bend,  is  the  most  unsymmetric  vibration,  and 
is  chosen  to  demonstrate  conservation  of  vibrational  purity. 

Mode  1  has  a  frequency  corresponding  to  3.44  Kcal/mole  and  a 
predicted  vibrational  time  of  27.69fS.  By  direct  calculation,  using  the 
DRC,  the  cycle  time  is  27.55fS.  The  rate  of  decay  of  this  mode  has  an 
estimated  half-life  of  a  few  thousands  femtoseconds. 


Rate  of  Decay  of  Starting  Mode 


For  trajectories  initiated  by  an  IRC=n  calculation,  whenever  the 
potential  energy  is  a  minimum  the  current  velocity  is  compared  with  the 
supplied  velocity.  The  square  of  the  cosine  of  the  angle  between  the 
two  velocity  vectors  is  a  measure  of  the  intensity  of  the  original  mode 
in  the  current  vibration. 


Half-Life  for  Decay  of  Initial  Mode 

Vibrational  purity  is  assumed  to  decay  according  to  zero' th  order 
kinetics.  The  half-life  is  thus  -0.6931472*t/log(psi2)  fS,  where  psi2 
is  the  square  of  the  overlap  integral  of  the  original  vibration  with  the 
current  vibration.  Due  to  the  very  slow  rate  of  decay  of  the  starting 
mode,  several  half-life  calculations  should  be  examined.  Only  when 
successive  half-lives  are  similar  should  any  confidence  be  placed  in 
their  value. 
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DRC  Print  Options 


The  amount  of  output  in  the  DRC  is  controlled  by  three  sets  of 
options.  These  sets  are: 

(a)  Equivalent  Keywords  H-PRIORITY,  T-PRIORITY,  and  X-PRIORITY 

(b)  Potential  Energy  Turning  Point  option. 

(c)  Geometry  Maxima  Turning  Point  options. 

If  T-PRIORITY  is  used  then  turning  points  cannot  be  monitored. 
Currently  H-PRIORITY  and  X-PRIORITY  are  not  implemented,  but  will  be  as 
soon  as  practical. 

To  monitor  geometry  turning  points,  put  a  "T"  in  place  of  the 
geometry  optimization  flag  for  the  relevant  geometric  variable. 

To  monitor  the  potential  energy  turning  points,  put  a  "T"  for  the 
flag  for  atom  1  bond  length  (Do  not  forget  to  put  in  a  bond-length  (zero 
will  do) ! ) . 

The  effect  of  these  flags  together  ‘  >  as  follows. 

1.  No  options:  All  calculated  points  will  be  printed.  No  turning 
points  will  be  calculated. 

2.  Atom  1  bond  length  flagged  with  a  "T":  If  T-PRIO,  etc.  are 
NOT  specified,  then  potential  energy  turning  points  will  be 
printed. 

3.  Internal  coordinate  flags  set  to  "T":  If  T-PRIO,  etc.  are  NOT 
specified,  then  geometry  extrema  will  be  printed.  If  only  one 
coordinate  is  flagged,  then  the  turning  point  will  be  displayed 
in  chronologic  order;  if  several  are  flagged  then  all  turning 
points  occuring  in  a  given  time-interval  will  be  printed  as 
they  are  detected.  In  other  words,  some  may  be  out  of 
chronologic  order.  Note  that  each  coordinate  flagged  will  give 
rise  to  a  different  geometry:  minimize  flagged  coordinates  to 
minimize  output. 

4.  Potential  and  geometric  flags  set:  The  effect  is  equivalent  to 
the  sum  of  the  first  two  options. 

5.  T-PRIO  set:  No  turning  points  will  be  printed,  but  constant 
time-slices  (by  default  O.lfS)  will  be  used  to  control  the 
print. 
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6.12  SPARKLES 


Four  extra  "elements"  have  been  put  into  MOPAC.  These  represent 
pure  ionic  charges,  roughly  equivalent  to  the  following  chemical 
entities: 

Chemical  Symbol  Equivalent  to 


+  Tetramethyl  ammonium  radical,  Potassium 

atom  or  Cesium  atom. 

++  Barium  atom. 

Borohydride  radical.  Halogen,  or 
Nitrate  radical 
Sulfate,  oxalate. 


For  the  purposes  of  discussion  these  entities  are  called 

"sparkles":  the  name  arises  from  consideration  of  their  behavior. 

Behavior  of  sparkles  in  MOPAC. 

Sparkles  have  the  following  properties: 

1.  Their  nuclear  charge  is  integer,  and  is  +1,  +2,  -1,  or  -2; 

there  are  an  equivalent  number  of  electrons  to  maintain 
electroneutrality,  1,  2,  -1,  and  -2  respectively.  For  example, 
a  '+'  sparkle  consists  of  a  unipositive  nucleus  and  an 

electron.  The  electron  is  donated  to  the  quantum  mechanics 
calculation. 

2.  They  all  have  an  ionic  radius  of  0.7  Angstroms.  Any  two 

sparkles  of  opposite  sign  will  form  an  ion-pair  with  a 

interatomic  separation  of  1.4A. 

3.  They  have  a  zero  heat  of  atomization,  no  orbitals,  and  no 

ionization  potential. 


They  can  be  regarded  as  unpolarizable  ions  of  diameter  1.4A.  They 
do  not  contribute  to  the  orbital  count,  and  cannot  accept  or  donate 
electrons. 

Since  they  appear  as  uncharged  species  which  immediately  ionize, 
attention  should  be  given  to  the  charge  on  the  whole  system.  For 
example,  if  the  alkaline  metal  salt  of  formic  acid  was  run,  the  formula 
would  be: 

HCOO+  where  +  is  the  unipositive  sparkle.  The  charge  on  the  system 
would  then  be  zero. 

A  water  molecule  polarized  by  a  positive  sparkle  would  have  the 
formula  H20+,  and  the  charge  on  the  system  would  be  +1 

At  first  sight,  a  sparkle  would  appear  to  be  too  ionic  to  be  a 
point  charge  and  would  combine  with  the  first  charge  of  opposite  sign  it 
encountered. 
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This  representation  is  faulty,  and  a  better  description  would  be  of 
an  ion,  of  diameter  1.4A,  and  the  charge  delocalized  over  its  surface. 
Computationally,  a  sparkle  is  an  integer  charge  at  the  center  of  a 
repulsion  sphere  of  form  exp(-alpha*r) .  The  hardness  of  the  sphere  is 
such  that  other  atoms  or  sparkles  can  approach  within  about  2  Angstroms 
quite  easily,  but  only  with  great  difficulty  come  closer  than  1.4A. 

Uses  of  Sparkles 

1.  They  can  be  used  as  counterions,  e.g.  for  acid  anions  or  for 
cations.  Thus,  if  the  ionic  form  of  an  acid  is  wanted,  then 
the  moieties  H.X,  H.-,  and  +.X  could  be  examined. 

2.  Two  sparkles  of  equal  and  opposite  sign  can  form  a  dipole  for 
mimicking  solvation  effects.  Thus  water  could  be  surrounded  by 
six  dipoles  to  simulate  the  solvent  cage.  A  dipole  of  value  D 
can  be  made  by  using  the  two  sparkles  +  and  -,  or  using  ++  and 

If  +  and  -  are  used,  the  inter-sparkle  separation  would  be 
D/4.803  Angstroms.  If  ++  and  —  are  used,  the  separation  would 
be  D/9.606  Angstroms.  If  the  inter-sparkle  separation  is  less 
than  1.0  Angstroms  (a  situation  that  cannot  occur  naturally) 
then  the  energy  due  to  the  dipole  on  its  own  is  subtracted  from 
the  total  energy. 

3.  They  can  operate  as  polarization  functions.  A  controlled, 
shaped  electric  field  can  easily  be  made  from  two  or  more 
sparkles.  The  polarizability  in  cubic  Angstroms  of  a  molecule 
in  any  particular  orientation  can  then  easily  be  calculated. 


6.13  MECHANISM  OP  THE  FRAME  IN  THE  FORCE  CALCULATION 

The  FORCE  calculation  uses  cartesian  coordinates,  and  all  3N  modes 
are  calculated,  where  N  is  the  number  of  atoms  in  the  system.  Clearly, 
there  will  be  5  or  6  "trivial"  vibrations,  which  represent  the  three 
translations  and  two  or  three  rotations.  If  the  molecule  is  exactly  at 
a  stationary  point,  then  these  "vibrations"  will  have  a  force  constant 
and  frequency  of  precisely  zero.  If  the  force  calculation  was  done 
correctly,  and  the  molecule  was  not  exactly  at  a  stationary  point,  then 
the  three  translations  should  be  exactly  zero,  but  the  rotations  would 
be  non-zero.  The  extent  to  which  the  rotations  are  non-zero  is  a 
measure  of  the  error  in  the  geometry. 

If  the  distortions  are  non-zero,  the  trivial  vibrations  can 
interact  with  the  low-lying  genuine  vibrations  or  rotations,  and  with 
the  transition  vibration  if  present. 

To  prevent  this  the  analytic  form  of  the  rotations  and  vibrations 
is  calculated,  and  arbitrary  eigenvalues  assigned;  these  are  500,  600, 
700,  800,  900,  and  1000  millidynes/angstrom  for  Tx,  Ty,  Tz,  Rx,  Ry  and 
Rz  (if  present),  respectively.  The  rotations  are  about  the  principal 
axes  of  inertia  for  the  system,  taking  into  account  isotopic  masses. 
The  "force  matrix"  for  these  trivial  vibrations  is  determined,  and  added 
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on  to  the  calculated  force  matrix.  After  diagonalization  the  arbitrary 
eigenvalues  are  subtracted  off  the  trivial  vibrations,  and  the  resulting 
numbers  are  the  "true"  values.  Interference  with  genuine  vibrations  is 
thus  avoided. 


6.14  CONFIGURATION  INTERACTION 

MOPAC  contains  a  very  large  Multi-Electron  Configuration 
Interaction  calculation,  MECI,  which  allows  almost  any  configuration 
interaction  calculation  to  be  performed.  Because  of  its  complexity,  two 
distinct  levels  of  input  are  supported;  the  default  values  will  be  of 
use  to  the  novice  while  an  expert  has  available  an  exhaustive  set  of 
keywords  from  which  a  specific  C.I.  can  be  tailored. 

A  MECI  calculation  involves  the  interaction  of  microstates 
representing  specific  permutations  of  electrons  in  a  set  of  M.O.'s. 
Starting  with  a  set  electronic  configuration,  either  closed  shell  or 
open  shell,  but  unconditionally  restricted  Hartree-Fock,  the  first  step 
in  a  MECI  calculation  is  the  removal  from  the  M.O.'s  of  the  electrons  to 
be  used  in  the  C.I. 

Each  microstate  is  then  constructed  from  these  empty  M.O.'s  by 
adding  in  electrons  according  to  a  prescription.  The  energy  of  the 
configuration  is  evaluated,  as  is  the  energy  of  interaction  with  all 
previously-defined  configurations.  Diagonalization  then  results  in 
state  functions.  From  the  eigenvectors  the  expectation  value  of  s**2  is 
calculated,  and  the  spin-states  of  the  state  functions  calculated. 


General  Overview  of  Keywords 


Keywords  associated  with  the  operations  of  MECI  are: 


SINGLET 

TRIPLET 

QUINTET 

0PEN(nl,n2) 


DOUBLET 

QUARTET 

SEXTET 

C.I.=n 


EXCITED 

BIRADICAL 

ESR 

MECI 


ROOT=n 


Each  keyword  may  imply  others;  thus  TRIPLET  implies  an  open-shell 
system,  therefore  OPEN(2,2),  and  C.I. =2  are  implied,  if  not  user 
specified. 


Starting  Electronic  Configuration 

MECI  is  restricted  to  RHF  calculations,  but  with  that  single 
restriction  any  starting  configuration  will  be  supported.  Examples  of 
starting  configurations  would  be 
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System 


Keywords  used 


Starting  Configuration 


Methane  <none> 

Methyl  Radical  <none> 

Twisted  Ethylene  TRIPLET 

Twisted  Ethylene  OPEN(2,2) 

Twisted  Ethylene  Cation  0PEN(1,2) 


2.00  2.00  2.00  2.00  2.00 
2.00  2.00  2.00  2.00  1.00 
2.00  2.00  2.00  1.00  1.00 
2.00  2.00  2.00  1.00  1.00 
2.00  2.00  2.00  0.50  0.50 


Methane  Cation 


CHARGE=1  OPEN (5,3)  2.00  2.00  1.67  1.67  1.67 


Choice  of  starting  configuration  is  important.  For  example,  if 
twisted  ethylene,  a  ground-state  triplet,  is  not  defined  using  TRIPLET 
or  OPEN (2,2),  then  the  closed-shell  ground-state  structure  will  be 
calculated.  Obviously,  this  configuration  is  a  legitimate  microstate, 
but  from  the  symmetry  of  the  system  a  better  choice  would  be  to  define 
one  electron  in  each  of  the  two  formally  degenerate  pi-type  M.O.'s.  The 
initial  SCF  calculation  does  not  distinguish  between  OPEN(2,2)  and 
TRIPLET  since  both  keywords  define  the  same  starting  configuration. 
This  can  be  verified  by  monitoring  the  convergence  using  PL,  for  which 
both  keywords  give  the  same  SCF  energy. 


Removal  of  Electrons  from  Starting  Configuration 

For  a  starting  configuration  of  alpha  M.O.  occupancies  O(i),  O(i) 
being  in  the  range  0.0  to  1.0,  the  energies  of  the  M.O.'s  involved  in 
the  MECI  can  be  calculated  from 

E ( i )  *  Sum{ j)(2J(i,j)-K(i,j))0(j) 


where  J(i,j)  and  K(i,j)  are  the  coulomb  and  exchange  integrals  between 
M.O.'s  i  and  j.  The  M.O.  index  j  runs  over  those  M.O.'s  involved  in 
the  MECI  only.  Most  MECI  calculations  will  involve  between  1  and  5 
M.O.'s,  so  a  system  with  about  30  filled  or  partly  filled  M.O.'s  could 
have  M.O.'s  25-30  involved.  The  resulting  eigenvalues  correspond  to 
those  of  the  cationic  system  resulting  from  removal  of  n  electrons, 
where  n  is  twice  the  sum  of  the  orbital  occupancies  of  those  M.O.'s 
involved  in  the  C.I. 

The  arbitrary  zero  of  energy  in  a  MECI  calculation  is  the  starting 
ground  state,  without  any  correction  for  errors  introduced  by  the  use  of 
fractional  occupancies.  In  order  to  calculate  the  energy  of  the  various 
configurations,  the  energy  of  the  vacuum  state  (i.e.,  the  state 
resulting  from  removal  of  the  electrons  used  in  the  C.I.)  needs  to  be 
evaluated.  This  energy  is  defined  by 


GSE  =  Sum( i ) [  E(i)0(i)  +  J(i,i)  *  0(i)*0(i) 

+  Sum( j<i) (  2 ( 2J( i , j )  -  K(i,j)  )  *  0(i)*0(j)  )  ) 
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Formation  of  Hicrostate  Configuration 

Microstates  are  particular  electron  configurations.  Thus  if  there 
are  5  electrons  in  5  levels,  then  various  microstates  could  be  as 
follows: 


Microstates  for  5  electrons  in  5  M.O.'s 


Electron  Configuration  Electron  Configuration 


Alpha 

Beta 

M(s) 

Alpha 

Beta 

M(s) 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  1,1, 1,0,0 

1,1, 0,0,0 

1/2 

4 

1,1, 1,1, 

1  0,0, 0,0,0 

5/2 

2  1,1, 0,0,0 

1,1, 1,0,0 

-1/2 

5 

1,1, 0,1, 

0  1,1, 0,0,0 

1/2 

3  1,1, 1,0,0 

0,0, 0,1,1 

1/2 

6 

1,1, 0,1, 

0  1,0, 1,0,0 

1/2 

For  5  electrons  in  5 

M.O.  's 

there  are 

252  microstates 

(101/(51*5!)),  but 

:  as  states 

of  different 

spin  do  not 

mix,  we  can 

use  a 

smaller  number . 

If  doublet 

states 

are 

needed  then  100 

states 

(51/(21*3! )*(5!/3!*2! )  are  needed.  If  only  quartet  states  are  of 
interest  then  25  states  (51/(11*4! )*(51/41*1! )  are  needed  and  if  the 
sextet  state  is  required,  then  only  one  state  is  calculated. 

In  the  microstates  listed,  state  1  is  the  ground-state 
configuration.  This  can  be  written  as  (2, 2, 1,0,0),  meaning  that  M.O.'s 
1  and  2  are  doubly  occupied,  M.O.  3  is  singly  occupied  by  an  alpha 
electron,  and  M.O.'s  4  and  5  are  empty.  Microstate  1  has  a  component  of 
spin  of  1/2,  and  is  a  pure  doublet.  By  Kramer's  degeneracy  -  sometimes 
called  time-inversion  symmetry  -  microstate  2  is  also  a  doublet,  and  has 
a  spin  of  1/2  and  a  component  of  spin  of  -1/2. 

Microstate  3,  while  it  has  a  component  of  spin  of  1/2,  is  not  a 
doublet,  but  is  in  fact  a  component  of  a  doublet,  a  quartet  and  a 
sextet.  The  coefficients  of  these  states  can  be  calculated  from  the 
Clebsch-Gordon  3-J  symbol.  For  example,  the  coefficient  in  the  sextet 
is  1/Sqrt( 5) . 

Microstate  4  is  a  pure  sextet.  If  all  100  microstates  of  component 
of  spin  =  1/2  were  used  in  a  C.I.,  one  of  the  resulting  states  would 
have  the  same  energy  as  the  state  resulting  from  microstate  4. 

Microstate  5  is  an  excited  doublet,  and  microstate  6  is  an  excited 
state  of  the  system,  but  not  a  pure  spin-state. 

By  default,  if  n  M.O.'s  are  included  in  the  MECI,  then  all  possible 
microstates  which  give  rise  to  a  component  of  spin  =  0  for  even  electron 
systems,  or  1/2  for  odd  electron  systems,  will  be  used. 
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Permutations  of  Electrons  among  Molecular  Orbitals 


(0,1) 

= 

0 

(2,4)  =  1100 

(3,5)  =  11100 

(2,5)  =  11000 

1010 

11010 

10100 

(1,1) 

S 

1 

1001 

11001 

10010 

0110 

10110 

10001 

(0,2) 

S 

0 

0101 

10101 

01100 

0011 

10011 

01010 

<lr2) 

S 

10 

OHIO 

01001 

01 

(1,4)  =  1000 

01101 

00110 

0100 

01011 

00101 

(1,3) 

ss 

100 

0010 

00111 

00011 

010 

0001 

001 

(2,3) 

= 

110 

101 


Sets  of  Microstates  for  Various  MECI  Calculations 
Odd  Electron  Systems  Even  Electron  Systems 


Alpha 

Beta 

No.  of 

Alpha 

Beta 

NO.  Of 

C.I.-l 

(1,1) 

* 

(0,1) 

Conf igs. 

=  1 

(1,1) 

* 

(1,1) 

Conf igs. 
=  1 

2 

(1,2) 

* 

(0,2) 

*  2 

(1,2) 

* 

(1,2) 

=  4 

3 

(2,3) 

* 

(1,3) 

*  9 

(2,3) 

* 

(2,3) 

=  9 

4 

(2,4) 

* 

(1,4) 

*  24 

(2,4) 

* 

(2,4) 

=  36 

5 

(3,5) 

* 

(2,5) 

=  100 

(3,5) 

* 

(3,5) 

=  100 

Multi  Electron  Configuration  Interaction 


The  numbering  of  the  M.O. 's  used  in  the  MECI  is  standard/  and 
follows  the  Aufbau  principle.  The  order  of  filling  is  in  order  of 
energy,  and  alpha  before  beta.  This  point  is  critically  important  in 
deciding  the  sign  of  matrix  elements.  For  a  5  M.O.  system,  then,  the 
order  of  filling  is. 

(1)(T)(2)(2)(3)(3)(4)(4)(5)<5) 


A  triplet  state  arising  from  two  microstates,  each  with  a  component 
of  spin  =  0,  will  thus  be  the  positive  combination. 

(T)  (2)  +  <l)(I) 


This  is  in  variance  with  the  sign  convention  used  in  earlier 
programs  for  running  MNDO.  This  standard  sign  convention  was  chosen  in 
order  to  allow  the  signs  of  the  microstate  coefficients  to  conform  to 
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those  resulting  from  the  spin  step-down  operator. 

Matrix  elements  between  all  pairs  of  microstates  are  calculated  in 
order  to  form  the  secular  determinant.  Many  elements  will  be 
identically  zero,  due  to  the  interacting  determinants  differing  by  more 
than  two  M.O.'s.  For  the  remaining  interactions  the  following  types  can 
be  identified. 

1.  The  two  determinants  are  identical: 

No  permutations  are  necessary  in  order  to  calculate  the 
sign  of  the  matrix  element.  E(p,p)  is  given  simply  by 

E(p,p)=Sum(i)Oa(i,p)*[Eig(i)  +  Sum(l/2(<ii  j j>-<i j | i j>) *Oa( j ,p) 

+  <ii  jj>  *Ob( j ,p) ) 

+  Sum( i)Ob( i,p)*[Eig( i )  +  Sum{ 1/2 (<ii | j j>-<i j | i j> ) *Ob( j ,p) ) 


Oa(i,p)  =  Occupancy  of  alpha  M.O.  i  in  .'icrostate  p 
Ob(i,p)  =  Occupancy  of  beta  M.O.  i  in  Microstate  p 


2.  Determinants  differing  by  exactly  one  M.O.: 

The  differing  M.O.  can  be  of  type  alpha  or  beta.  It  is 
sufficient  to  evaluate  the  case  in  which  both  M.O.'s  are  of 
alpha  type,  the  beta  form  is  obtained  in  like  manner. 

E(p,q)  =  Sum ( k )  [  <ij|kk>  -  <ik|jk>  )  *  (Occa(k)  -  Occg(k)) 

+  <i j  J kk>  *  (Occb(k)  -  Occg(k)] 

E(p,q)  may  need  to  be  multiplied  by  -1,  if  the  number  of 

two  electron  permutations  required  to  bring  M.O.'s  i  and  j 
into  coincidence  is  odd. 


Where  Occa(k)  is  the  alpha  molecular  orbital  occupancy  in 
the  configuration  interaction. 

3.  Determinants  differing  by  exactly  two  M.O.'s: 

The  two  M.O.'s  can  have  the  same  or  opposite  spins.  Three 
cases  can  be  identified: 

1.  Both  M.O.'s  have  alpha  spin: 

For  the  first  microstate  having  M.O.'s  i  and  j,  and 
the  second  microstate  having  M.O.'s  k  and  1,  the  matrix 
element  connecting  the  two  microstates  is  given  by 

Q(p*q)  =  <ik | jl>  -  <il | jk> 

E(p,q)  may  need  to  be  multiplied  by  -1,  if  the  number  of 
two  electron  permutations  required  to  bring  M.O.  i  into 
coincidence  with  M.O.  k  and  M.O.  j  into  coincidence  with 
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M.O.  1  is  odd. 

2.  Both  M.O. ' s  have  beta  spin: 

The  matrix  element  is  calculated  in  the  same  manner  as 
in  the  previous  case. 

3.  One  M.O.  has  alpha  spin,  and  one^ beta  spin: 

For  the  first  microstate  having  M.O.'s  alpha(i)  and 
beta(j),  and  the  second  microstate  having  M.O.'s  alpha(k) 
and  beta(l),  the  matrix  element  connecting  the  two 
microstates  is  given  by 

Q(P#q)  *  <ik | jl> 

E(p,q)  may  need  to  be  multiplied  by  -1,  if  the  number  of 
two  electron  permutations  required  to  bring  M.O.  i  into 
coincidence  with  M.O.  k  and  M.O.  j  into  coincidence  with 
M.O.  1  is  odd. 


States  Arising  from  Various  Calculations 


Bach  MECI  calculation  invoked  by  use  of  the  keyword  C.I.-n  normally 
gives  rise  to  states  of  quantized  spins.  When  C.I.  is  used  without  any 
other  modifying  keywords,  the  following  states  will  be  obtained. 


.  Of  M.O. 

's  States  Arising 

States  Arising  From 

From  Odd  Electron  Systems 

Even 

Electron  Systems 

in  MECI 

Doublets 

Singl 

ats  Triplets 

1 

1 

1 

2 

2 

3 

1 

3 

8  1 

6 

3 

4 

20  4 

20 

15 

5 

75  24  1 

50 

45 

These  numbers  of  spin  states  will  be  obtained  irrespective  of  the 
chemical  nature  of  the  system. 
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Calculation  of  Spin-States 


In  order  to  calculate  the  spin-state,  the  expectation  value  of  S2 
is  calculated. 

S2  =  S(S+1)  =  Sz**2  +  2*S ( + ) S ( - ) 


=  Ne  - 

Sum(i)  [C(i,k)*C(i,k)*(l/4*(Na{ i)-Nb(i) )**2 
+  Sum(l)  Oa(l,i)*Ob(l,i) ) 

+Sum( j )  2[C(i,k)*C(j,k)* ( Kroneke r delta ( C ( i , k ) (  S(  +  )S(-)  ) C ( j  r  k ) ] ] 


Where 


Ne  =  No.  of  electrons  in  C.I. 

C(i,k)  =  Coefficient  of  Microstate  i  in  State  k 

Na(i)  =  Number  of  alpha  electrons  in  Microstate  i 

Nb(i)  =  Number  of  beta . electrons  in  Microstate  i 
Oa(l,k)  =  Occupancy  of  alpha  M.O.  1  in  Microstate  k 
Ob(l,k)  =  Occupancy  of  beta  M.O.  1  in  Microstate  k 
S(+)  =  Spin  shift  up  or  step  up  operator 

S ( — )  =  Spin  shift  down  or  step  down  operator 

The  Kronekerdelta  is  1  if  the  two  terms  in  brackets  following  it 
are  identical. 


The  spin  state  is  calculated  from  S  =  1/2  {  Sqrt(l+4*S2)  -  1  ) 

In  practice,  S  is  calculated  to  be  exactly  integer,  or  half 
integer.  That  is,  there  is  insignificant  error  due  to  approximations 
used.  This  does  not  mean,  however,  that  the  method  is  accurate.  The 
spin  calculation  is  completely  precise,  in  the  group  theoretic  sense, 
but  the  accuracy  of  the  calculation  is  limited  by  the  Hamiltonian  used, 
a  space-dependent  function. 


Choice  of  State  to  be  Optimized 

MECI  can  calculate  a  large  number  of  states  of  various  total  spin. 
Two  schemes  are  provided  to  allow  a  given  state  to  be  selected.  First, 
ROOT=n  will,  when  used  on  its  own,  select  the  n’th  state,  irrespective 
of  its  total  spin.  By  default  n=l.  If  ROOT=n  is  used  in  conjunction 
with  a  keyword  from  the  set  SINGLET,  DOUBLET,  TRIPLET,  QUARTET,  QUINTET, 
or  SEXTET,  then  the  n’th  root  of  that  spin-state  will  be  used.  For 
example,  ROOT=4  and  SINGLET  will  select  the  4th  singlet  state.  If  there 
are  two  triplet  states  below  the  fourth  singlet  state  then  this  will 
mean  that  the  sixth  state  will  be  selected. 
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Calculation  of  Unpaired  Spin  Density 

Starting  with  the  state  functions  as  linear  combinations  of 
configurations#  the  unpaired  spin  density,  corresponding  to  the  alpha 
spin  density  minus  the  beta  spin  density,  will  be  calculated  for  the 
first  few  states.  This  calculation  is  straightforward  for  diagonal 
terms,  and  only  those  terms  are  used. 


6.15  REDUCED  MASSES  IN  A  FORCE  CALCULATION 

Reduced  masses  for  a  diatomic  are  given  by 
(massl)  *  (mass2) 


(massl)  +  (mass2) 

For  a  Hydrogen  molecule  the  reduced  mass  is  thus  0.5;  for  heavily 
hydrogenated  systems,  e.g.  methane,  the  reduced  mass  can  be  very  low. 
A  vibration  involving  only  heavy  atoms  ,  e.g.  a  C-N  in  cyanide,  should 
give  a  large  reduc'J  aass. 

For  the  "tri  ;ial"  vibrations  the  reduced  mass  is  ill-defined,  and 
where  this  happens  the  reduced  mass  is  set  to  zero. 


6.16  USE  OF  SADDLE  CALCULATION 

A  SADDLE  calculation  uses  two  complete  geometries,  as  shown  on  the 
following  data  file  for  the  ethyl  radical  hydrogen  migration  from  one 
methyl  group  to  the  other. 


120 


BACKGROUND 


Page  6-41 


Line 

Line 

Line 

1 

2 

3 

UHF  SADDLE 

ETHYL  RADICAL  HYDROGEN  MIGRATION 

Line 

4 

C 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

Line 

5 

C 

1.479146 

1 

0.000000 

0 

0.000000 

0 

1 

0 

0 

Line 

6 

H 

1.109475 

1 

111.328433 

1 

0.000000 

0 

2 

1 

0 

Line 

7 

H 

1.109470 

1 

111.753160 

1 

120.288410 

1 

2 

1 

3 

Line 

8 

H 

1.109843 

1 

110.103163 

1 

240.205278 

1 

2 

1 

3 

Line 

9 

H 

1.082055 

1 

121.214083 

1 

38.110989 

1 

1 

2 

3 

Line 

10 

H 

1.081797 

1 

121.521232 

1 

217.450268 

1 

1 

2 

3 

Line 

11 

0 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

Line 

12 

C 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

Line 

13 

C 

1.479146 

1 

0.000000 

0 

0.000000 

0 

1 

0 

0 

Line 

14 

H 

1.109475 

1 

111.328433 

1 

0.000000 

0 

2 

1 

0 

Line 

15 

H 

1.109470 

1 

111.753160 

1 

120.288410 

1 

2 

1 

3 

Line 

16 

H 

2.109843 

1 

30.103163 

1 

240.205278 

1 

2 

1 

3 

Line 

17 

H 

1.082055 

1 

121.214083 

1 

38.110989 

1 

1 

2 

3 

Line 

18 

H 

1.081797 

1 

121.521232 

1 

217.450268 

1 

1 

2 

3 

Line 

Line 

19 

20 

0 

0.000000 

0 

0.000000 

0 

0.000000 

0 

0 

0 

0 

Details  o£  the  mathematics  of  SADDLE  appeared  in  print  in  1984, 
(M.  J.  S.  Dewar,  E.  F.  Healy,  J.  J.  P.  Stewart,  J.  Chem.  Soc.  Faraday 
Trans.  II,  3,  227,  (1984))  so  only  a  superficial  description  will  be 
given  here. 


The  main  steps  in  the  saddle  calculation  are  as  follows: 


1.  The  heats  of  formation  of  both  systems  are  calculated. 

2.  A  vector  R  of  length  3N-6  defining  the  difference  between  the 
two  geometries  is  calculated. 

3.  The  scalar  P  of  the  difference  vector  is  reduced  by  some 
fraction,  normally  about  5  to  15  percent. 

4.  Identify  the  geometry  of  lower  energy;  call  this  G. 

5.  Optimize  G,  subject  to  the  constraint  that  it  maintains  a 
constant  distance  P  from  the  other  geometry. 

6.  If  the  newly-optimized  geometry  is  higher  in  energy  then  the 
other  geometry,  then  go  to  1.  If  it  is  higher,  and  the  last 
two  steps  involved  the  same  geometry  moving,  make  the  other 
geometry  G  without  modifying  P,  and  go  to  5. 

7.  Otherwise  go  back  to  2. 


The  mechanism  of  5  involves  the  coordinates  of  the  moving  geometry 
being  perturbed  by  an  amount  equal  to  the  product  of  the  discrepancy 
between  the  calculated  and  required  P  and  the  vector  R. 
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As  the  specification  of  the  geometries  is  quite  difficult,  in  that 
the  difference  vector  depends  on  angles  (which  are,  of  necessity 

ill-defined  by  360  degrees)  SADDLE  can  be  made  to  run  in  cartesian 

coordinates  using  the  keyword  XYZ.  If  this  option  is  chosen  then  the 
initial  steps  of  the  calculation  are  as  follows: 

1.  Both  geometries  are  converted  into  cartesian  coordinates. 

2.  Both  geometries  are  centered  about  the  origin  of  cartesian 

space. 

3.  One  geometry  is  rotated  until  the  difference  vector  is  a 
minimum  -  this  minimum  is  within  1  degree  of  the  absolute 
bottom. 

4.  The  SADDLE  calculation  then  proceeds  as  described  above. 
LIMITATIONS: 

The  two  geometries  must  be  related  by  a  continuous  deformation  of 
the  coordinates.  By  default,  internal  coordinates  are  used  in 
specifying  geometries,  and  while  bond  lengths  and  bond  angles  are 
unambiguously  defined  (being  both  positive),  the  dihedral  angles  can  be 
positive  or  negative.  Clearly  300  degrees  could  equally  well  be 
specified  as  -60  degrees.  A  wrong  choice  of  dihedral  would  mean  that 
instead  of  the  desired  reaction  vector  being  used,  a  completely 
incorrect  vector  was  used,  with  disastrous  results. 

To  correct  this,  ensure  that  one  geometry  can  be  obtained  from  the 
other  by  a  continuous  deformation,  or  use  the  XYZ  option. 


6.17  HOW  TO  ESCAPE  FROM  A  HILLTOP 

A  particularly  irritating  phenomenon  sometimes  occurs  when  a 
transition  state  is  being  refined.  A  rough  estimate  of  the  geometry  of 
the  transition  state  has  been  obtained  by  either  a  SADDLE  or  reaction 
path  or  by  good  guesswork.  This  geometry  is  then  refined  by  SIGMA  or  by 
NLLSQ,  and  the  system  characterized  by  a  force  calculation.  It  is  at 
this  point  that  things  often  go  wrong.  Instead  of  only  one  negative 
force  constant,  two  or  more  are  found.  In  the  past,  the  recommendation 
has  been  to  abandon  the  work  and  to  go  on  to  something  less  masochistic. 
It  is  possible,  however,  to  systematically  progress  from  a  multiple 
maximum  to  the  desired  transition  state.  The  technique  used  will  now  be 
described. 

If  a  multiple  maximum  is  identified,  most  likely  one  negative  force 
constant  corresponds  to  the  reaction  coordinate,  in  which  case  the 
objective  is  to  render  the  other  force  constants  positive.  The 
associated  normal  mode  eigenvalues  are  complex,  but  in  the  output  are 
printed  as  negative  frequencies,  and  for  the  sake  of  simplicity  will  be 
described  as  negative  vibrations.  Use  DRAW-2  to  display  the  negative 
vibrations,  and  identify  which  mode  corresponds  to  the  reaction 
coordinate.  This  is  the  one  we  need  to  retain. 
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Hitherto,  simple  motion  in  the  direction  of  the  other  modes  has 
proved  difficult.  However  the  DRC  provides  a  convenient  mechanism  for 
automatically  following  a  normal  coordinate.  Pick  the  largest  of  the 
negative  modes  to  be  annihilated,  and  run  the  DRC  along  that  mode  until 
a  minimum  is  reached.  At  that  point,  refine  the  geometry  once  more 
using  SIGMA  and  repeat  the  procedure  until  only  one  negative  mode 
exists. 

To  be  on  the  safe  side,  after  each  DRC+SIGMA  sequence  do  the 
DRC+SIGMA  operation  again,  but  use  the  negative  of  the  initial  normal 
coordinate  to  start  the  trajectory.  After  both  stationary  points  are 
reached,  choose  the  lower  point  as  the  starting  point  for  the  next 
elimination.  The  lower  point  is  chosen  because  the  transition  state 
wanted  is  the  highest  point  on  the  lowest  energy  path  connecting 
reactants  to  products.  Sometimes  the  two  points  will  have  equal  energy: 
this  is  normally  a  consequence  of  both  trajectories  leading  to  the  same 
point  or  symmetry  equivalent  points. 

After  all  spurious  negative  modes  have  been  eliminated,  the 
remaining  normal  mode  corresponds  to  the  reaction  coordinate,  and  the 
transition  state  has  been  located. 

This  technique  is  relatively  rapid,  and  relies  on  starting  from  a 
stationary  point  to  begin  each  trajectory.  If  any  other  point  is  used, 
the  trajectory  will  not  be  even  roughly  simple  harmonic.  If,  by 
mistake,  the  reaction  coordinate  is  selected,  then  the  potential  energy 
will  drop  to  that  of  either  the  reactants  or  products,  which, 
incidentally,  forms  a  handy  criterion  for  selecting  the  spurious  modes: 
if  the  potential  energy  only  drops  by  a  small  amount,  and  the  time 
evolution  is  roughly  simple  harmonic,  then  the  mode  is  one  of  the 
spurious  modes.  If  there  is  any  doubt  as  to  whether  a  minimum  is  in  the 
vicinity  of  a  stationary  point,  allow  the  trajectory  to  continue  until 
one  complete  cycle  is  executed.  At  that  point  the  geometry  should  be 
near  to  the  initial  geometry. 

Superficially,  a  line-search  might  appear  more  attractive  than  the 
relatively  expensive  DRC.  However,  a  line-search  in  cartesian  space 
will  normally  not  locate  the  minimum  in  a  mode.  An  obvious  example  is 
the  mode  corresponding  to  a  methyl  rotation. 

Keyword  Sequences  to  be  Used 

1.  To  locate  the  starting  stationary  point  given  an  approximate 
transition  state:- 

SIGMA 


2.  To  define  the  normal  modes :- 
FORCE  ISOTOPE 

At  this  point,  copy  all  the  files  to  a  second  filename,  for  use 
later. 
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3.  Given  vibrational  frequencies  of  -654,  -123,  234,  and  456, 

identify  via  DRAW-2  the  normal  coordinate  mode,  let's  say  that 
is  the  -654  mode.  Eliminate  the  second  mode  by: 

IRC=2  DRC  T=30M  RESTART  LARGE 
Use  is  made  of  the  FORCE  restart  file. 

4.  Identify  the  minimum  in  the  potential  energy  surface  by 
inspection  or  using  the  VAX  SEARCH  command,  of  form: 

SEARCH  <Filename> .OUT  % 


5.  Edit  out  of  the  output  file  the  data  file  corresponding  to  the 
lowest  point,  and  refine  the  geometry  using: 

SIGMA 


6.  Repeat  the  last  three  steps  but  for  the  negative  of  the  normal 
mode,  using  the  copied  files.  The  keywords  for  the  first  of 
the  two  jobs  are: 

IRC=-2  DRC  T=30M  RESTART  LARGE 


7.  Repeat  the  last  four  steps  as  often  as  there  are  spurious 
modes. 

8.  Finally,  carry  out  a  DRC  to  confirm  that  the  transition  state 
does,  in  fact,  connect  the  reactants  and  products.  The  drop  in 
potential  energy  should  be  monotonic.  If  you  are  unsure 
whether  this  last  operation  will  work  successfully,  do  it  at 
any  time  you  have  a  stationary  point.  If  it  fails  at  the  very 
start,  then  we  are  back  where  we  were  last  year  —  give  up  and 
go  home  J ! 


6.18  POLARIZABILITY  CALCULATION 

If  the  electrons  in  a  molecule  are  easily  moved  as  the  result  of  a 
stimulus,  then  the  molecule  is  easily  polarizable.  Thus,  if  an  applied 
electric  field  can  easily  induce  a  dipole,  then  the  polarizability  is 
large.  Any  induced  dipole  will  lower  the  energy  of  the  system,  but  this 
stabilization  might  be  masked  by  the  presence  of  a  permanent  dipole.  To 
avoid  this,  use  is  made  of  an  alternating  electric  field.  If  the 
molecule  has  an  intrinsic  dipole,  then  the  molecule  will  be  stabilized 
in  one  direction.  When  the  field  is  reversed,  the  molecule  will  be 
destabilized,  but,  on  averaging  the  two  effects,  the  result  is  a  net 
stabilization  due  only  to  the  induced  dipole. 
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Originally,  MOPAC  calculated  the  polarizability  of  molecules, 
radicals,  and  ions  by  use  of  a  shaped  electric  field. 

In  the  current  version  of  MOPAC  the  polarizability  and 
hyperpolarizability  are  calculated  by  direct  perturbation  of  the 
Hamiltonian  matrix  elements.  This  technique  was  developed  by 
Dr.  Henry  A.  Kurtz  of  Memphis  State  University  while  on  a  USAF-UES 
Summer  Faculty  Research  Program.  The  following  discussion  assumes  that 
a  homogeneous  electric  field  gradient  exists  across  the  molecule. 

The  heat  of  formation  of  the  molecule  in  this  field  is  then 
calculated.  This  quantity  can  be  expressed  as  a  series  sum. 

Heat  =  H.o.F  -  V*E{Charge)  -  dV/dx*E( Dipole) 

-  d2V/dx2*E{polar izability ) 


That  is,  the  heat  of  formation  in  the  field  is  the  sum  of  the  basic 
heat  of  formation,  less  the  electric  potential  times  any  charge,  any 
dipole  times  the  electric  field  gradient,  and  any  polarizability  times 
the  square  of  the  electric  field  gradient. 

We  are  interested  in  the  polarizability,  P. 

P  =  (2/23.061) *d**2H/dE**2 


The  second  derivative  of  H  with  respect  to  E  is  given  by 
d**2H/dE**2  =  (H(E)+H(-E)-2*H(0) )/( 2*E) , 


H ( E )  being  the  heat  of  formation  in  the  electric  field. 

The  polarizability  volume,  Vol,  is  calculated  from  the 
polarizability  by 

Vol=P/(E*4*pi*E(o) )  =  2/(E*23.061*4*pi*E(o) )  *  d**2H/dE**2 
Substituting  for  E  we  have 

Vol=2*l**4*pi*E(o)/( 23. 061*Q*Q*C*C* ( 1-1/2** ( -1/3 ) )**2)  *  d**2H/dE**2. 


It  is  a  simple  matter  to  evaluate  the  value  of  this  second-rank  tensor 
by  calculating  the  heats  of  formation  of  the  molecule  subject  to  four 
different  electric  field  gradients.  For  the  tensor  component  V(i,j), 
i=x  or  y  or  z,  j=x  or  y  or  z,  the  directions  of  the  four  different 
fields  are  defined  by. 


Field  1 
Field  2 


+  i,  +j 
-i»  -3 


Field  2 
Field  4 


+  i/  ~j 
-i,  +j. 


Thus  if  i=x  and  j=x  the  four  fields  are 
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Field 

1 

+x 

Field 

2 

0 

Field 

3 

-x 

Field 

4 

0 

Using  these  four  heats  of  formation,  in  Kcal/mole,  the 

polarizability  can  be  calculated  in  units  of  cubic  angstroms  via 


Vol  =  ( Heat ( 2 ) +Heat ( 4 ) -Heat ( 1 ) -Heat ( 3 ) ) * { 1*1*1*1 ) *2*pi*Eo 
23.061  *  (1-a)  *  (1-a)  *  Q  *  Q  *  C  *  C 

lev  is  1.60219  *  10-19  Joules 

Eo  is  8.854188  *  10-i2  Joules**(-l) .C**2.M** (-1) 

or  8.854188  *  10-22  Joules**(-l) .C**2.Angstroms**(-l) 

Vol  =  (eV  *  1**4  *  J**(-l)  *  C**2  *  M**(-l) ) 

C**2 

Vol  =  2  *  3.1415926  *  8.854188*10-22  /  (23.061  *  1.60219*10-19) 

=(Heat(2)+Heat( 4 )-Heat(l)-Heat< 3)) *0.0015056931* (1*1*1*1) 
(l-a)*(l-a)*Q*Q 


Monopolar  and  dipolar  terms  are  eliminated  in  this  treatment. 

Finally,  monatomic  additive  terms  are  included  when  MNDO  is  used. 

A  polarization  matrix  of  size  3*3  is  constructed  and 
diagonalized,  and  the  resulting  eigenvalues  are  the  calculated 
independent  polarization  volumes  in  cubic  Angstroms;  the  vectors  are  the 
independent  polarization  vectors. 


6.19  SOLID  STATE  CAPABILITY 

Currently  MOPAC  can  only  handle  up  to  one-dimensional  extended 
systems.  As  the  solid-state  method  used  is  unusual,  details  are  given 
at  this  point. 

If  a  polymer  unit  cell  is  large  enough,  then  a  single  point  in 
k-space,  the  Gamma  point,  is  sufficient  to  specify  the  entire  Brillouin 
zone.  The  secular  determinant  for  this  point  can  be  constructed  by 
adding  together  the  Fock  matrix  for  the  central  unit  cell  plus  those  for 
the  adjacent  unit  cells.  The  Born-von  Karman  cyclic  boundary  conditions 
are  satisfied,  and  diagonalization  yields  the  correct  density  matrix  for 
the  Gamma  point. 
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At  this  point  in  the  calculation,  conventionally,  the  density 
matrix  for  each  unit  cell  is  constructed.  Instead,  the  Gamma-point 
density  and  one-electron  density  matrices  are  combined  with  a 
"Gamma-point-like"  Coulomb  and  exchange  integral  strings  to  produce  a 
new  Fock  matrix.  The  calculation  can  be  visualized  as  being  done 
entirely  in  reciprocal  space,  at  the  Gamma  point. 

Most  solid-state  calculations  take  a  very  long  time.  These 
calculations,  called  "Cluster"  calculations  after  the  original 
publication,  require  between  1.3  and  2  times  the  equivalent  molecular 
calculation. 

A  minor  "fudge"  is  necessary  to  make  this  method  work.  The 
contribution  to  the  Fock  matrix  element  arising  from  the  exchange 
integral  between  an  atomic  orbital  and  its  equivalent  in  the  adjacent 
unit  cells  is  ignored.  This  is  necessitated  by  the  fact  that  the 
density  matrix  element  involved  is  invariably  large. 

The  unit  cell  must  be  large  enough  that  an  atomic  orbital  in  the 
center  of  the  unit  cell  has  an  insignificant  overlap  with  the  atomic 
orbitals  at  the  ends  of  the  unit  cell.  In  practice,  a  translation 
vector  of  more  that  about  7  or  8  Angstroms  is  sufficient.  For  one  rare 
group  of  compounds  a  larger  translation  vector  is  needed.  Polymers  with 
delocalized  pi-systems,  and  polymers  with  very  small  band-gaps  will 
require  a  larger  translation  vector,  in  order  to  accurately  sample 
k-space.  For  these  systems,  a  translation  vector  in  the  order  of  15-20 
Angstroms  is  needed. 
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PROGRAM 


The  logic  within  MOPAC  is  best  understood  by  use  of  flow-diagrams. 

There  are  two  main  sequences,  geometric  and  electronic.  These  join 
only  at  one  common  subroutine  COMPFG.  It  is  possible,  therefore,  to 
understand  the  geometric  or  electronic  sections  in  isolation,  without 
having  studied  the  other  section. 
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7.1  MAIN  GEOMETRIC  SEQUENCE 
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7.2  MAIN  ELECTRONIC  FLOW 


COMPFG  (See  GEOMETRIC  SEQUENCE) 


_ IZZ 

HCORE 


DERIV 


DC  ART 


ROTATE 
HI ELEC 


FOCK1 

FOCK2 


GMETRY 

SYMTRY 


"1-1“ 


DHC 

ITER 

RSP 

MECI 


DENSIT 

CNVG 

PULAY 


DIAT 
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7.3  CONTROL  WITHIN  MOPAC 

Almost  all  the  control  information  is  passed  via  the  single  datum 
"KEYWRD" ,  a  string  of  80  characters,  which  is  read  in  at  the  start  of 
the  job. 

Each  subroutine  is  made  independent,  as  far  as  possible,  even  at 
the  expense  of  extra  code  or  calculation.  Thus,  for  example,  the  SCF 
criterion  is  set  in  subroutine  ITER,  and  nowhere  else.  Similarly, 
subroutine  DERIV  has  exclusive  control  of  the  step  size  in  the 
finite-difference  calculation  of  the  energy  derivatives.  If  the  default 
values  are  to  be  reset,  then  the  new  value  is  supplied  in  KEYWRD,  and 
extracted  via  INDEX  and  READA.  The  flow  of  control  is  decided  by  the 
presence  of  various  keywords  in  KEYWRD. 

When  a  subroutine  is  called,  it  assumes  that  all  data  required  for 
its  operation  are  available  in  either  common  blocks  or  arguments. 
Normally  no  check  is  made  as  to  the  validity  of  the  data  received.  All 
data  are  "owned"  by  one,  and  only  one,  subroutine.  Ownership  means  the 
implied  permission  and  ability  to  change  the  data.  Thus  MOLDAT  "owns" 
the  number  of  atomic  orbitals,  in  that  it  calculates  this  number,  and 
stores  it  in  the  variable  NORBS.  Many  subroutines  use  NORBS,  but  none 
of  them  is  allowed  to  change  it.  For  obvious  reasons  no  exceptions 
should  be  made  to  this  rule.  To  illustrate  the  usefulness  of  this 
convention,  consider  the  eigenvectors,  C  and  CBETA .  These  are  owned  by 
ITER.  Before  ITER  is  called,  C  and  CBETA  are  not  calculated,  after  ITER 
has  been  called  C  and  CBETA  are  known,  so  any  subroutine  which  needs  to 
use  the  eigenvectors  can  do  so  in  the  certain  knowledge  that  they  exist. 

Any  variables  which  are  only  used  within  a  subroutine  are  not 
passed  outside  the  subroutine  unless  an  overriding  reason  exists.  This 
is  found  in  PULAY  and  CNVG,  among  others  where  arrays  used  to  hold 
spin-dependent  data  are  used,  and  these  cannot  conveniently  be  defined 
within  the  subroutines.  In  these  examples,  the  relevant  arrays  are 
"owned"  by  ITER. 

A  general  subroutine,  of  which  ITER  is  a  good  example,  handles 
three  kinds  of  data:  First,  data  which  the  subroutine  is  going  to  work 
on,  for  example  the  one  and  two  electron  matrices;  second,  data 
necessary  to  manipulate  the  first  set  of  data,  such  as  the  number  of 
atomic  orbitals;  third,  the  calculated  quantities,  here  the  electronic 
energy,  and  the  density  and  Fock  matrices. 

Reference  data  are  entered  into  a  subroutine  by  way  of  the  common 
blocks.  This  is  to  emphasize  their  peripheral  role.  Thus  the  number  of 
orbitals,  while  essential  to  ITER,  is  not  central  to  the  task  it  has  to 
perform,  and  is  passed  through  a  common  block. 

Data  the  subroutine  is  going  to  work  on  are  passed  via  the  argument 
list.  Thus  the  one  and  two  electron  matrices,  which  are  the  main  reason 
for  ITER's  existence,  are  entered  as  two  of  the  four  arguments.  As  ITER 
does  not  own  these  matrices  it  can  use  them  but  may  not  change  their 
contents.  The  other  argument  is  EE,  the  electronic  energy.  EE  is  owned 
by  ITER  even  though  it  first  appears  before  ITER  is  called. 
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Sometimes  common  block  data  should  more  correctly  appear  in  an 
argument  list.  This  is  usually  not  done  in  order  to  prevent  obscuring 
the  main  role  the  subroutine  has  to  perform.  Thus  ITER  calculates  the 
density  and  Fock  matrices,  but  these  are  not  represented  in  the  argument 
list  as  the  calling  subroutine  never  needs  to  know  them;  instead,  they 
are  stored  in  common. 

SUBROUTINE  GMETRY ;  Description  for  programmers. 

GMETRY  has  two  arguments,  GEO  and  COORD.  On  input  GEO  contains 
either  (a)  internal  coordinates  or  (b)  cartesian  coordinates.  On  exit 
COORD  contains  the  cartesian  coordinates. 

The  normal  mode  of  usage  is  to  supply  the  internal  coordinates,  in 
which  case  the  connectivity  relations  are  found  in  common  block  GEOKST. 

If  the  contents  of  NA(1)  is  zero,  as  required  for  any  normal 
system,  then  the  normal  internal  to  cartesian  conversion  is  carried  out. 

If  the  contents  of  NA(1)  is  99,  then  the  coordinates  found  in  GEO 
are  assumed  to  be  cartesian,  and  no  conversion  is  made.  This  is  the 
situation  in  a  FORCE  calculation. 

A  further  option  exists  within  the  internal  to  cartesian 
conversion.  If  STEP,  stored  in  common  block  REACTN ,  is  non-zero,  then  a 
reaction  path  is  assumed,  and  the  internal  coordinates  are  adjusted 
radially  in  order  that  the  "distance"  in  internal  coordinate  space  from 
the  geometry  specified  in  GEO  is  STEPP  away  from  the  geometry  stored  in 
GEOA,  stored  in  REACTN. 

During  the  internal  to  cartesian  conversion,  the  angle  between  the 
three  atoms  used  in  defining  a  fourth  atom  is  checked  to  ensure  that  it 
is  not  near  to  0  or  180  degrees.  If  it  is  near  to  these  angles,  then 
there  is  a  high  probability  that  a  faulty  geometry  will  be  generated  and 
to  prevent  this  the  calculation  is  stopped  and  an  error  message  printed. 

NOTE  1:  If  the  angle  is  exactly  0  or  180  degrees,  then  the 
calculation  is  not  terminated:  This  is  the  normal  situation  in  a 
high-symmetry  molecule  such  as  propyne. 

NOTE  2:  The  check  is  only  made  if  the  fourth  atom  has  a  bond  angle 
which  is  not  zero  or  180  degrees. 
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MOPAC  produces  several  hundred  messages,  all  of  which  are  intended 
to  be  self-explanatory.  However,  when  an  error  occurs  it  is  useful  to 
have  more  information  than  is  given  in  the  standard  messages. 

The  following  alphabetical  list  gives  more  complete  definitions  of 
the  messages  printed. 

AN  UNOPTIMI ZABLE  GEOMETRIC  PARAMETER.... 

When  internal  coordinates  are  supplied,  six  coordinates  cannot  be 
optimized.  These  are  the  three  coordinates  of  atom  1,  the  angle  and 
dihedral  on  atom  2  and  the  dihedral  on  atom  3.  An  attempt  has  been  made 
to  optimize  one  of  these.  This  is  usually  indicative  of  a  typographic 
error,  but  might  simply  be  an  oversight.  Either  way,  the  error  will  be 
corrected  and  the  calculation  will  not  be  stopped  here. 

ATOM  NUMBER  nn  IS  ILLDEFINED 

The  rules  for  definition  of  atom  connectivity  are: 

1.  Atom  2  must  be  connected  to  atom  1  (default  -  no  override) 

2.  Atom  3  must  be  connected  to  atom  1  or  2,  and  make  an  angle  with 
2  or  1. 

3.  All  other  atoms  must  be  defined  in  terms  of  already-defined 

atoms:  these  atoms  must  all  be  different.  Thus  atom  9  might 

be  connected  to  atom  5,  make  an  angle  with  atom  6,  and  have  a 
dihedral  with  atom  7.  If  the  dihedral  was  with  atom  5,  then 
the  geometry  definition  would  be  faulty. 


If  any  of  these  rules  is  broken,  a  fatal  error  message  is  printed, 
and  the  calculation  stopped. 
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ATOMIC  NUMBER  nn  IS  NOT  AVAILABLE  . . . 

An  element  has  been  used  for  which  parameters  are  not  available. 
Only  if  a  typographic  error  has  been  made  can  this  be  rectified.  This 
check  is  not  exhaustive,  in  that  even  if  the  elements  are  acceptable 
there  are  some  combinations  of  elements  within  MINDO/3  that  are  not 
allowed.  This  is  a  fatal  error  message. 

ATOMIC  NUMBER  OF  nn  ? 

An  atom  has  been  specified  with  a  negative  or  zero  atomic  number. 
This  is  normally  caused  by  forgetting  to  specify  an  atomic  number  or 
symbol.  This  is  a  fatal  error  message. 

ATOMS  nn  AND  nn  ARE  SEPARATED  BY  nn.nnnn  ANGSTROMS. 

Two  genuine  atoms  (not  dummies)  are  separated  by  a  very  small 
distance.  This  can  occur  when  a  complicated  geometry  is  being 
optimized,  in  which  case  the  user  may  wish  to  continue.  This  can  be 
done  by  using  the  keyword  GEO-OK.  More  often,  however,  this  message 
indicates  a  mistake,  and  the  calculation  is,  by  default,  stopped. 

ATTEMPT  TO  GO  DOWNHILL  IS  UNSUCCESSFUL. . . 

A  quite  rare  message,  produced  by  Bartel's  gradient  norm 
minimization.  Bartel's  method  attempts  to  minimize  the  gradient  norm  by 
searching  the  gradient  space  for  a  minimum.  Apparently  a  minimum  has 
been  found,  but  not  recognized  as  such.  The  program  has  searched  in  all 
(3N-6)  directions,  and  found  no  way  down,  but  the  criteria  for  a  minimum 
have  not  been  satisfied.  No  advice  is  available  for  getting  round  this 
error . 


BOTH  SYSTEMS  ARE  ON  THE  SAME  SIDE... 

A  non-fatal  message,  but  still  cause  for  concern.  During  a  SADDLE 
calculation  the  two  geometries  involved  are  on  opposite  sides  of  the 
transition  state.  This  situation  is  verified  at  every  point  by 
calculating  the  cosine  of  the  angle  between  the  two  gradient  vectors. 
For  as  long  as  it  is  negative,  then  the  two  geometries  are  on  opposite 
sides  of  the  T/S.  If,  however,  the  cosine  becomes  positive,  then  the 
assumption  is  made  that  one  moiety  has  fallen  over  the  T/S  and  is  now 
below  the  other  geometry.  That  is,  it  is  now  further  from  the  T/S  than 
the  other,  temporarily  fixed,  geometry.  To  correct  this,  identify 
geometries  corresponding  to  points  on  each  side  of  the  T/S.  (Two 
geometries  on  the  output  separated  by  the  message  "SWAPPING...")  and 
make  up  a  new  data-file  using  these  geometries.  This  corresponds  to 
points  on  the  reaction  path  near  to  the  T/S.  Run  a  new  job  using  these 
two  geometries,  but  with  BAR  set  to  a  third  or  a  quarter  of  its  original 
value,  e.g.  BAR=0.05.  This  normally  allows  the  T/S  to  be  located. 
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C.I.  NOT  ALLOWED  WITH  UHP 

There  is  no  UHF  configuration  interaction  calculation  in  MOPAC. 
Either  remove  the  keyword  that  implies  C.I.  or  the  word  UHP. 

CALCULATION  ABANDONED  AT  THIS  POINT 

A  particularly  annoying  message  I  In  order  to  define  an  atom's 
position,  the  three  atoms  used  in  the  connectivity  table  must  not 
accidentally  fall  into  a  straight  line.  This  can  happen  during  a 
geometry  optimization  or  gradient  minimization.  If  they  do,  and  if  the 
angle  made  by  the  atom  being  defined  is  not  zero  or  180  degrees,  then 
its  position  becomes  ill-defined.  This  is  not  desirable,  and  the 
calc\lation  will  stop  in  order  to  allow  corrective  action  to  be  taken. 
Note  that  if  the  three  atoms  are  in  an  exactly  straight  line,  this 
message  will  not  be  triggered.  The  good  news  is  that  the  criterion  used 
to  trigger  this  message  was  set  too  coarsely.  The  criterion  has  been 
tightened  so  that  this  message  now  does  not  often  appear.  Geometric 
integrity  does  not  appear  to  be  compromized. 

CARTESIAN  COORDINATES  READ  IN,  AND  CALCULATION... 

If  cartesian  coordinates  are  read  in,  but  the  calculation  is  to  be 
carried  out  using  internal  coordinates,  then  either  all  possible 
geometric  variables  must  be  optimized,  or  none  can  be  optimized.  If 
only  some  are  marked  for  optimization  then  ambiguity  exists.  For 
example,  if  the  "X"  coordinate  of  atom  6  is  marked  for  optimization,  but 
the  "Y"  is  not,  then  when  the  conversion  to  internal  coordinates  takes 
place,  the  first  coordinate  becomes  a  bond-length,  and  the  second  an 
angle.  These  bear  no  relationship  to  the  "XM  or  "Y"  coordinates.  This 
is  a  fatal  error. 

CARTESIAN  COORDINATES  READ  IN,  AND  SYMMETRY... 

If  cartesian  coordinates  are  read  in,  but  the  calculation  is  to  be 
carried  out  using  internal  coordinates,  then  any  symmetry  relationships 
between  the  cartesian  coordinates  will  not  be  reflected  in  the  internal 
coordinates.  For  example,  if  the  "Y"  coordinates  of  atoms  5  and  6  are 
equal,  it  does  net  follow  that  the  internal  coordinate  angles  these 
atoms  make  are  equal.  This  is  a  fatal  error. 

ELEMENT  NOT  FOUND 

When  an  external  file  is  used  to  redefine  MNDO,  AMI,  or  PM3 
parameters,  the  chemical  symbols  used  must  correspond  to  known  elements. 
Any  that  do  not  will  trigger  this  fatal  message. 
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ERROR  DURING  READ  AT  ATOM  NUMBER  .... 

Something  is  wrong  with  the  geometry  data.  In  order  to  help  find 
the  error,  the  geometry  already  read  in  is  printed.  The  error  lies 
either  on  the  last  line  of  the  geometry  printed,  or  on  the  next 
(unprinted)  line.  This  is  a  fatal  error. 

FAILED  IN  SEARCH,  SEARCH  CONTINUING 

Not  a  fatal  error.  The  Mclver-Komornicki  gradient  minimization 
involves  use  of  a  line-search  to  find  the  lowest  gradient.  This  message 
is  merely  advice.  However,  if  SIGMA  takes  a  long  time,  consider  doing 
something  else,  such  as  using  NLLSQ,  or  refining  the  geometry  a  bit 
before  resubmitting  it  to  SIGMA. 

«« - ****  FAILED  TO  ACHIEVE  SCF.  **** - »» 

The  SCF  calculation  failed  to  go  to  completion;  an  unwanted  and 
depressing  message  that  unfortunately  appears  every  so  often. 

To  date  three  unconditional  convergers  have  appeared  in  the 
literature:  the  SHIFT  technique,  Pulay's  method,  and  the  Camp-King 
converger.  It  would  not  be  fair  to  the  authors  to  condemn  their 
methods.  In  MOPAC  all  sorts  of  weird  and  wonderful  systems  are 
calculated,  systems  the  authors  of  the  convergers  never  dreamed  of. 
MOPAC  uses  a  combination  of  all  three  convergers  at  times.  Normally 
only  a  quadratic  damper  is  used. 

If  this  message  appears,  suspect  first  that  the  calculation  might 
be  faulty,  then,  if  you  feel  confident,  use  PL  to  monitor  a  single  SCF. 
Based  on  the  SCF  results  either  increase  the  number  of  allowed 
iterations,  default:  200,  or  use  PULAY,  or  Camp-King,  or  a  mixture. 

If  nothing  works,  then  consider  slackening  the  SCF  criterion.  This 
will  allow  heats  of  formation  to  be  calculated  with  reasonable 
precision,  but  the  gradients  are  likely  to  be  imprecise. 

GEOMETRY  TOO  UNSTABLE  FOR  EXTRAPOLATION. . 

In  a  reaction  path  calculation  the  initial  geometry  for  a  point  is 
calculated  by  quadratic  extrapolation  using  the  previous  three  points. 

If  a  quadratic  fit  is  likely  to  lead  to  an  inferior  geometry,  then 
the  geometry  of  the  last  point  calculated  will  be  used.  The  total 
effect  is  to  slow  down  the  calculation,  but  no  user  action  is 
recommended. 
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**  GRADIENT  IS  TOO  LARGE  TO  ALLOW. . . 

Before  a  FORCE  calculation  can  be  performed  the  gradient  norm  must 
be  so  small  that  the  third  and  higher  order  components  of  energy  in  the 
force  field  are  negligible.  If,  in  the  system  under  examination,  the 
gradient  norm  is  too  large,  the  gradient  norm  will  first  be  reduced 
using  FLEPO,  unless  LET  has  been  specified.  In  some  cases  the  FORCE 
calculation  may  be  run  only  to  decide  if  a  state  is  a  ground  state  or  a 
transition  state,  in  which  case  the  results  have  only  two 
interpretations.  Under  these  circumstances,  LET  may  be  warranted. 

GRADIENT  IS  VERY  LARGE... 

In  a  calculation  of  the  thermodynamic  properties  of  the  system,  if 
the  rotation  and  translation  vibrations  are  non-zero,  as  would  be  the 
case  if  the  gradient  norm  was  significant,  then  these  "vibrations"  would 
interfere  with  the  low-lying  genuine  vibrations.  The  criteria  for 
THERMO  are  much  more  stringent  than  for  a  vibrational  frequency 
calculation,  as  it  is  the  lowest  few  genuine  vibrations  that  determine 
the  internal  vibrational  energy,  entropy,  etc. 

ILLEGAL  ATOMIC  NUMBER 

An  element  has  been  specified  by  an  atomic  number  which  is  not  in 
the  range  1  to  107.  Check  the  data:  the  first  datum  on  one  of  the 
lines  is  faulty.  Most  likely  line  4  is  faulty. 

IMPOSSIBLE  NUMBER  OF  OPEN  SHELL  ELECTRONS 

The  keyword  OPEN(nl,n2)  has  been  used,  but  for  an  even-electron 
system  nl  was  specified  as  odd  or  for  an  odd-electron  system  nl  was 
specified  as  even.  Either  way,  there  is  a  conflict  which  the  user  must 
resolve. 

IMPOSSIBLE  OPTION  REQUESTED 

A  general  catch-all.  This  message  will  be  printed  if  two 
incompatible  options  are  used,  such  as  both  MINDO/3  and  AMI  being 
specified.  Check  the  keywords,  and  resolve  the  conflict. 

INTERNAL  COORDINATES  READ  IN,  AND  CALCULATION... 

If  internal  coordinates  are  read  in,  but  the  calculation  is  to  be 
carried  out  using  cartesian  coordinates,  then  either  all  possible 
geometric  variables  must  be  optimized,  or  none  can  be  optimized.  If 
only  some  are  marked  for  optimization,  then  ambiguity  exists.  For 
example,  if  the  bond-length  of  atom  6  is  marked  for  optimization,  but 
the  angle  is  not,  then  when  the  conversion  to  cartesian  coordinates 
takes  place,  the  first  coordinate  becomes  the  "X”  coordinate  and  the 
second  the  "Y"  coordinate.  These  bear  no  relationship  to  the  bond 
length  or  angle.  This  is  a  fatal  error. 
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INTERNAL  COORDINATES  READ  IN,  AND  SYMMETRY... 

If  internal  coordinates  are  read  in,  but  the  calculation  is  to  be 
carried  out  using  cartesian  coordinates,  then  any  symmetry  relationships 
between  the  internal  coordinates  will  not  be  reflected  in  the  cartesian 
coordinates.  For  example,  if  the  bond-lengths  of  atoms  5  and  6  are 
equal,  it  does  not  follow  that  these  atoms  have  equal  values  for  their 
"X"  coordinates.  This  is  a  fatal  error. 

JOB  STOPPED  BY  OPERATOR 

Any  MOPAC  calculation,  for  which  the  SHUTDOWN  command  works,  can  be 
stopped  by  a  user  who  issues  the  command  "$SHUT  <filename>,  from  the 
directory  which  contains  <f ilename>.DAT 

MOPAC  will  then  stop  the  calculation  at  the  first  convenient  point, 
usually  after  the  current  cycle  has  finished.  A  restart  file  will  be 
written  and  the  job  ended.  The  message  will  be  printed  as  soon  as  it  is 
detected,  which  would  be  the  next  time  the  timer  routine  is  accessed. 

****  MAX.  NUMBER  OF  ATOMS  ALLOWED: _ 

At  compile  time  the  maximum  sizes  of  the  arrays  in  MOPAC  are  fixed. 
The  system  being  run  exceeds  the  maximum  number  of  atoms  allowed.  To 
rectify  this,  modify  the  file  DIMSIZES.DAT  to  increase  the  number  of 
heavy  and  light  atoms  allowed.  If  DIMSIZES.DAT  is  altered,  then  the 
whole  of  MOPAC  should  be  re-compiled  and  re-linked. 

****  MAX.  NUMBER  OF  ORBITALS:.... 

At  compile  time  the  maximum  sizes  of  the  arrays  in  MOPAC  are  fixed. 
The  system  being  run  exceeds  the  maximum  number  of  orbitals  allowed.  To 
rectify  this,  modify  the  file  DIMSIZES.DAT  to  change  the  number  of  heavy 
and  light  atoms  allowed.  If  DIMSIZES.DAT  is  altered,  then  the  whole  of 
MOPAC  should  be  re-compiled  and  re-linked. 

****  MAX.  NUMBER  OF  TWO  ELECTRON  INTEGRALS.. 

At  compile  time  the  maximum  sizes  of  the  arrays  in  MOPAC  are  fixed. 
The  system  being  run  exceeds  the  maximum  number  of  two-electron 
integrals  allowed.  To  rectify  this,  modify  the  file  DIMSIZES.DAT  to 
modify  the  number  of  heavy  and  light  atoms  allowed.  If  DIMSIZES.DAT  is 
altered,  then  the  whole  of  MOPAC  should  be  re-compiled  and  re-linked. 

NAME  NOT  FOUND 

Various  atomic  parameters  can  be  modified  in  MOPAC  by  use  of 
EXTERNAL-.  These  comprise 


Uss 

Betas 

Gp2 

GSD 

Upp 

Betap 

Hsp 

GPD 

Udd 

Betad 

AMI 

GDD 

ZS 

Gss 

Expc 

FN1 

Zp 

Gsp 

Gaus 

FN2 

Zd 

Gpp 

Alp 

FN3 
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Thus  to  change  the  Uss  of  hydrogen  to  -13.6  the  line 
USS  H  -13.6 

could  be  used.  If  an  attempt  is  made  to  modify  any  other  parameters, 
then  an  error  message  is  printed,  and  the  calculation  terminated. 

NUMBER  OF  PARTICLES,  nn  GREATER  THAN... 

When  user-defined  microstates  are  not  used,  the  MECI  will  calculate 
all  possible  microstates  that  satisfy  the  space  and  spin  constraints 
imposed.  This  is  done  in  PERM,  which  permutes  N  electrons  in  M  levels. 
If  N  is  greater  than  M,  then  no  possible  permutation  is  valid.  This  is 
not  a  fatal  error  -  the  program  will  continue  to  run,  but  no  C.I.  will 
be  done. 


NUMBER  OF  PERMUTATIONS  TOO  GREAT,  LIMIT  60 

The  number  of  permutations  of  alpha  or  beta  microstates  is  limited 
to  60.  Thus  if  3  alpha  electrons  are  permuted  among  5  M.O.'s,  that  will 
generate  10  =  51/(31*2!)  alpha  microstates,  which  is  an  allowed  number. 
However  if  4  alpha  electrons  are  permuted  among  8  M.O.'s,  then  70  alpha 
microstates  result  and  the  arrays  defined  will  be  insufficient.  Note 
that  60  alpha  and  60  beta  microstates  will  permit  3600  microstates  in 
all,  which  should  be  more  than  sufficient  for  most  purposes.  (An 
exception  would  be  for  excited  radical  icosohedral  systems.) 

SYMMETRY  SPECIFIED,  BUT  CANNOT  BE  USED  IN  DRC 

This  is  self  explanatory.  The  DRC  requires  all  geometric 
constraints  to  be  lifted.  Any  symmetry  constraints  will  first  be 
applied,  to  symmetrize  the  geometry,  and  then  removed  to  allow  the 
calculation  to  proceed. 

SYSTEM  DOES  NOT  APPEAR  TO  BE  OPTIMI ZABLE 

This  is  a  gradient  norm  minimization  message.  These  routines  will 
only  work  if  the  nearest  minimum  to  the  supplied  geometry  in 
gradient-norm  space  is  a  transition  state  or  a  ground  state.  Gradient 
norm  space  can  be  visualized  as  the  space  of  the  scalar  of  the 
derivative  of  the  energy  space  with  respect  to  geometry.  To  a  first 
approximation,  there  are  twice  as  many  minima  in  gradient  norm  space  as 
there  are  in  energy  space. 

It  is  unlikely  that  there  exists  any  simple  way  to  refine  a 
geometry  that  results  in  this  message.  While  it  is  appreciated  that  a 
large  amount  of  effort  has  probably  already  been  expended  in  getting  to 
this  point,  users  should  steel  themselves  to  writing  off  the  whole 
geometry.  It  is  not  recommended  that  a  minor  change  be  made  to  the 
geometry  and  the  job  re-submitted. 

Try  using  SIGMA  instead  of  POWSQ. 
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TEMPERATURE  RANGE  STARTS  TOO  LOW, . . . 

The  thermodynamics  calculation  assumes  that  the  statistical 
summations  can  be  replaced  by  integrals.  This  assumption  is  only  valid 
above  100K,  so  the  lower  temperature  bound  is  set  to  100,  and  the 
calculation  continued. 

THERE  IS  A  RISK  OF  INFINITE  LOOPING... 

The  SCF  criterion  has  been  reset  by  the  user,  and  the  new  value  is 
so  small  that  the  SCF  test  may  never  be  satisfied.  This  is  a  case  of 
user  beware  1 

THIS  MESSAGE  SHOULD  NEVER  APPEAR,  CONSULT  A  PROGRAMMER! 

This  message  should  never  appear;  a  fault  has  been  introduced  into 
MOPAC,  most  probably  as  a  result  of  a  programming  error.  If  this 
message  appears  in  the  vanilla  version  of  MOPAC  (a  version  ending  in 
00),  please  contact  JJPS  as  I  would  be  most  interested  in  how  this  was 
achieved. 


THREE  ATOMS  BEING  USED  TO  DEFINE.... 

If  the  cartesian  coordinates  of  an  atom  depend  on  the  dihedral 
angle  it  makes  with  three  other  atoms,  and  those  three  atoms  fall  in  an 
almost  straight  line,  then  a  small  change  in  the  cartesian  coordinates 
of  one  of  those  three  atoms  can  cause  a  large  change  in  its  position. 
This  is  a  potential  source  of  trouble,  and  the  data  should  be  changed  to 
make  the  geometric  specification  of  the  atom  in  question  less  ambiguous. 

This  message  can  appear  at  any  time,  particularly  in  reaction  path 
and  saddle-point  calculations. 

An  exception  to  this  rule  is  if  the  three  atoms  fall  into  an 
exactly  straight  line.  For  example,  if,  in  propyne,  the  hydrogens  are 
defined  in  terms  of  the  three  carbon  atoms,  then  no  error  will  be 
flagged.  In  such  a  system  the  three  atoms  in  the  straight  line  must  not 
have  the  angle  between  them  optimized,  as  the  finite  step  in  the 
derivative  calculation  would  displace  one  atom  off  the  straight  line  and 
the  error-trap  would  take  effect. 

Correction  involves  re-defining  the  connectivity.  LET  and  GEO-OK 
will  not  allow  the  calculation  to  proceed. 

-  TIME  up  - 

The  time  defined  on  the  keywords  line  or  3,600  seconds,  if  no  time 
was  specified,  is  likely  to  be  exceeded  if  another  cycle  of  calculation 
were  to  be  performed.  A  controlled  termination  of  the  run  would  follow 
this  message.  The  job  may  terminate  earlier  than  expected:  this  is 
ordinarily  due  to  one  of  the  recently  completed  cycles  taking  unusually 
long,  and  the  safety  margin  has  been  increased  to  allow  for  the 
possibility  that  the  next  cycle  might  also  run  for  much  longer  than 
expected. 
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TRIPLET  SPECIFIED  WITH  ODD  NUMBER  OF  ELECTRONS. 

If  TRIPLET  has  been  specified  the  number  of  electrons  must  be  even. 
Check  the  charge  on  the  system,  the  empirical  formula,  and  whether 
TRIPLET  was  intended. 

. UNABLE  TO  ACHIEVE  SELF-CONSISTENCY 

See  the  error-message: 

«« - ****  FAILED  TO  ACHIEVE  SCF.  **** - »» 

UNDEFINED  SYMMETRY  FUNCTION  USED 

Symmetry  operations  are  restricted  to  those  defined,  i.e.,  in  the 
range  1-18.  Any  other  symmetry  operations  will  trip  this  fatal  message. 

UNRECOGNIZED  ELEMENT  NAME 

In  the  geometric  specification  a  chemical  symbol  which  does  not 
correspond  to  any  known  element  has  been  used.  The  error  lies  in  the 
first  datum  on  a  line  of  geometric  data. 

****  WARNING  **** 

Don't  pay  too  much  attention  to  this  message.  Thermodynamics 
calculations  require  a  higher  precision  than  vibrational  frequency 
calculations.  In  particular,  the  gradient  norm  should  be  very  small. 
However,  it  is  frequently  not  practical  to  reduce  the  gradient  norm 
further,  and  to  date  no-one  has  determined  just  how  slack  the  gradient 
criterion  can  be  before  unacceptable  errors  appear  in  the  thermodynamic 
quantities.  The  0.4  gradient  norm  is  only  a  suggestion. 

WARNING:  INTERNAL  COORDINATES... 

Triatomics  are,  by  definition,  defined  in  terms  of  internal 
coordinates.  This  warning  is  only  a  reminder.  For  diatomics,  cartesian 
and  internal  coordinates  are  the  same.  For  tetra-atomics  and  higher, 
the  presence  or  absence  of  a  connectivity  table  distinguishes  internal 
and  cartesian  coordinates,  but  for  triatomics  there  is  an  ambiguity.  To 
resolve  this,  cartesian  coordinates  are  not  allowed  for  the  data  input 
for  triatomics. 


CHAPTER  9 
CRITERIA 


MOPAC  uses  various  criteria  which  control  the  precision  of  its 
stages.  These  criteria  are  chosen  as  the  best  compromise  between  speed 
and  acceptable  errors  in  the  results.  The  user  can  override  the  default 
settings  by  use  of  keywords;  however,  care  should  be  exercised  as 
increasing  a  criterion  can  introduce  the  potential  for  infinite  loops, 
and  decreasing  a  criterion  can  result  in  unacceptably  imprecise  results. 
These  are  usually  characterized  by  'noise'  in  a  reaction  path,  or  large 
values  for  the  trivial  vibrations  in  a  force  calculation. 


9 . 1  SCF  CRITERION 


Name:  SCFCRT. 

Defined  in  ITER. 

]  Default  value  0.0001  kcal/mole 

Basic  Test  Change  in  energy  in  kcal/mole  on  successive 

iterations  3  less  than  SCFCRT. 

Exceptions:  If  PRECISE  is  specified,  SCFCRT=0. 000001 

If  a  polarization  calculation  SCFCRT=1 .D-ll 

If  a  FORCE  calculation  SCFCRT=0. 0000001 

If  SCFCRT=n.nnn  is  specified  SCFCRT=n.nnn 

If  a  BFGS  optimization,  SCFCRT  becomes  a  function 
of  the  difference  between  the  current  energy  and 
the  lowest  energy  of  previous  SCFs. 

Secondary  tests:  (1)  Change  in  density  matrix  elements  on  two 

successive  iterations  must  be  less  than  0.001 
(2)  Change  in  energy  in  eV  on  three  successive 
iterations  must  be  less  than  10  x  SCFCRT. 
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9.2  GEOMETRIC  OPTIMIZATION  CRITERIA 


Name: 

Defined  in 
Default  value 
Basic  Test 


Exceptions 
Name : 

Defined  in 
Default  value 
Basic  Test 


Exceptions 

Name: 

Defined  in 
Default  value 
Basic  Test 


Exceptions 


Name: 

Defined  in 
Default  value 
Basic  Test 


Exceptions 


TOLERX  "Test  on  X  Satisfied" 

FLEPO 

0.0001  Angstroms 

The  projected  change  in  geometry  is  less  than 
TOLERX  Angstroms. 

If  GNORM  is  specified,  the  TOLERX  test  is  not  used. 

DELHOF  "Herbert's  Test  Satisfied" 

FLEPO 

0.001 

The  projected  decrease  in  energy  is  less  than 
DELHOF  Kcals/mole. 

If  GNORM  is  specified,  the  DELHOF  test  is  not  used. 

TOLERG  "Test  on  Gradient  Satisfied" 

FLEPO 

1.0 

The  gradient  norm  in  Kcals/mole/Angstrom  is  less 
than  TOLERG  multiplied  by  the  square  root  of  the 
number  of  coordinates  to  be  optimized. 

If  GNORM=n.nnn  is  specified,  TOLERG=n . nnn  divided 
by  the  square  root  of  the  number  of  coordinates 
to  be  optimized,  and  the  secondary  tests  are  not 
done.  If  LET  is  not  specified,  n.nnn  is  reset  to 
0.01,  if  it  was  smaller  than  0.01. 

If  PRECISE  is  specified,  TOLERG=0.2 

If  a  SADDLE  calculation,  TOLERG  is  made  a  function 
of  the  last  gradient  norm. 

TOLERF  "Heat  of  Formation  Test  Satisfied" 

FLEPO 

0.002  Kcal/mole 

The  calculated  heats  of  formation  on  two  successive 
cycles  differ  by  less  than  TOLERF. 

If  GNORM  is  specified,  the  TOLERF  test  is  not  used. 


Secondary  Tests  For  the  TOLERG,  TOLERF,  and  TOLERX  tests,  a 

second  test  in  which  no  individual  component  of  the 
gradient  should  be  larger  than  TOLERG  must  be 
satisfied. 


Other  Tests 


Exceptions 

Name: 


If,  after  the  TOLERG,  TOLERF,  or  TOLERX  test  has  been 
satisfied  three  consecutive  times  the  heat  of 
formation  has  dropped  by  less  than  0.3Kcal/mole,  then 
the  optimization  is  stopped. 

If  GNORM  is  specified,  then  this  test  is  not  performed. 
TOL2 
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Defined  in 
Default  value 
Basic  Test 


Exceptions 


Name: 

Defined  in 
Default  Value 
Basic  Test 


Name: 

Defined  in 
Default  Value 
Basic  Test 


POWSQ 

0.4 

The  absolute  value  of  the  largest  component  of  the 
gradient  is  less  than  TOL2 

If  PRECISE  is  specified,  TOL2-0.01 
If  GNORM=n . nn  is  specified,  TOL2=n.nn 
If  LET  is  not  specified,  TOL2  is  reset  to 
0.01,  if  n.nn  was  smaller  than  0.01. 

TOLS1 

NLLSQ 

0.000  000  000  001 

The  square  of  the  ratio  of  the  projected  change  in  the 
geometry  to  the  actual  geometry  is  less  than  TOLS1. 

<none> 

NLLSQ 

0.2 

Every  component  of  the  gradient  is  less  than  0.2. 


CHAPTER  10 


DEBUGGING 


There  are  three  potential  sources  of  difficulty  in  using  MOPAC, 
each  of  which  requires  special  attention.  There  can  be  problems  with 
data,  due  to  errors  in  the  data,  or  MOPAC  may  be  called  upon  to  do 
calculations  for  which  it  was  not  designed.  There  are  intrinsic  errors 
in  MOPAC  which  extensive  testing  has  not  yet  revealed,  but  which  a 
user's  novel  calculation  uncovers.  Finally  there  can  be  bugs  introduced 
by  the  user  modifying  MOPAC,  either  to  make  it  compatible  with  the  host 
computer,  or  to  implement  local  features. 

For  whatever  reason,  the  user  may  need  to  have  access  to  more 
information  than  the  normal  keywords  can  provide,  and  a  second  set, 
specifically  for  debugging,  is  provided.  These  keywords  give 
information  about  the  working  of  individual  subroutines,  and  do  not 
affect  the  course  of  the  calculation. 


10.1  DEBUGGING  KEYWORDS 

FULL  LIST  OF  KEYWORDS  FOR  DEBUGGING  SUBROUTINES 

INFORMATION  PRINTED 


1ELEC 
COMPFG 
DC  ART 
DEBUG 
DEBUGPULAY 
DENSITY 
DERI1 
DERI  2 
DERITR 
DERIV 
|  DERNVO 
D FORCE 
|  DIIS 
EIGS 
FLEPO 
FMAT 
FOCK 


the  one-electron  matrix. 

Heat  of  Formation. 

Cartesian  derivatives. 

Pulay  matrix,  vector,  and  error-function. 
Every  density  matrix. 

Details  of  DERI1  calculation 

Details  of  DERI 2  calculation 

Details  of  DERITR  calculation 

All  gradients,  and  other  data  in  DERIV. 

Details  of  DERNVO  calculation 

Print  Force  Matrix. 

Details  of  DIIS  calculation 
All  eigenvalues. 

Details  of  BFGS  minimization. 

Every  Fock  matrix 


Note  1 


Note  2 
Note  3 
Note  1 


Note  1 
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HCORE 

ITEB 

LARGE 

LINMIN 

MOLDAT 

MECI 

PL 

LINMIN 

TIMES 

VECTORS 


The  one  electron  matrix,  and  two  electron  integrals. 
Values  of  variables  and  constants  in  ITER. 

Increases  amount  of  output  generated  by  other  keywords. 
Details  of  line  minimization  (LINMIN,  LOCMIN,  SEARCH) 
Molecular  data,  number  of  orbitals,  "UN  values,  etc. 

C.I.  matrices,  M.O.  indices,  e.tc. 

Differences  between  density  matrix  elements  Note  4 

in  ITER. 

Function  values,  step  sizes  at  all  points  in  the 
line  minimization  (LINMIN  or  SEARCH). 

Times  of  stages  within  ITER. 

All  eigenvectors  on  every  iteration.  Note  1 


NOTES 

1.  These  keywords  are  activated  by  the  keyword  DEBUG.  Thus  if 
DEBUG  and  FOCK  are  both  specified,  every  Fock  matrix  on  every 
iteration  will  be  printed. 

2.  DEBUG  is  not  intended  to  increase  the  output,  but  does  allow 
other  keywords  to  have  a  special  meaning. 

3.  PULAY  is  already  a  keyword,  so  DEBUGPULAY  was  an  obvious 
alternative. 

4.  PL  initiates  the  output  of  the  value  of  the  largest  difference 
between  any  two  density  matrix  elements  on  two  consecutive 
iterations.  This  is  very  useful  when  investigating  options  for 
increasing  the  rate  of  convergence  of  the  SCF  calculation. 


SUGGESTED  PROCEDURE  FOR  LOCATING  BUGS 

Users  are  supplied  with  the  source  code  for  MOPAC,  and,  while  the 
original  code  is  fairly  bug-free,  after  it  has  been  modified  there  is  a 
possibility  that  bugs  may  have  been  introduced.  In  these  circumstances 
the  author  of  the  changes  is  obviously  responsible  for  removing  the 
offending  bug,  and  the  following  ideas  might  prove  useful  in  this 
context. 

First  of  all,  and  most  important,  before  any  modifications  are  done 
a  back-up  copy  of  the  standard  MOPAC  should  be  made.  This  will  prove 
invaluable  in  pinpointing  deviations  from  the  standard  working.  This 
point  cannot  be  over -emphasized  -  MAKE  A  BACK-UP  BEFORE  MODIFYING 
MOPAC 1  I  I ! 

Clearly,  a  bug  can  occur  almost  anywhere,  and  a  logical  search 
sequence  is  necessary  in  order  to  minimize  the  time  taken  to  locate  it. 

If  possible,  perform  the  debugging  with  a  small  molecule,  in  order 
to  save  time  (debugging  is,  of  necessity,  time  consuming)  and  to 
minimize  output. 
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The  two  sets  of  subroutines  in  MOPAC,  those  involved  with  the 
electronics  and  those  involved  in  the  geometries,  are  kept  strictly 
separate,  so  the  first  question  to  be  answered  is  which  set  contains  the 
bug.  If  the  heats  of  formation,  derivatives,  I.P.s,  and  charges,  etc., 
are  correct,  the  bug  lies  in  the  geometries?  if  faulty,  in  the 
electronics. 

Bug  in  the  Electronics  Subroutines. 

Use  formaldehyde  for  this  test.  The  supplied  data-file  MNRSDl.DAT 
could  be  used  as  a  template  for  this  operation.  Use  keywords  1SCF, 
DEBUG,  and  any  others  necessary. 

The  main  steps  are: 

(1)  Check  the  starting  one-electron  matrix  and  two-electron 
integral  string,  using  the  keyword  HCORE .  It  is  normally  sufficient  to 
verify  that  the  two  hydrogen  atoms  are  equivalent,  and  that  the  pi 
system  involves  only  pz  on  oxygen  and  carbon.  Note  that  numerical 
values  are  not  checked,  but  only  relative  values. 

If  an  error  is  found,  use  MOLDAT  to  verify  the  orbital  character, 

etc. 

If  faulty  the  error  lies  in  READ,  GETGEO  or  MOLDAT. 

Otherwise  the  error  lies  in  HCORE,  HI ELEC  or  ROTATE. 

If  the  starting  matrices  are  correct,  go  on  to  step  (2). 

(2)  Check  the  density  or  Fock  matrix  on  every  iteration,  with  the 
words  FOCK  or  DENSITY.  Check  the  equivalence  of  the  two  hydrogen  atoms, 
and  the  pi  system,  as  in  (1). 

If  an  error  is  found,  check  the  first  Fock  matrix.  If  faulty,  the 
bug  lies  in  ITER,  probably  in  the  Fock  subroutines  FOCK1  or  FOCK2 .  or 
in  the  (guessed)  density  matrix  (MOLDAT).  An  exception  is  in  the  UHF 
closed-shell  calculation,  where  a  small  asymmetry  is  introduced  to 
initiate  the  separation  of  the  alpha  and  beta  UHF  wavefunctions. 

If  no  error  is  found,  check  the  second  Fock  matrix.  If  faulty,  the 
error  lies  in  the  density  matrix  DENSIT,  or  the  diagonalization  RSP. 

If  the  Fock  matrix  is  acceptable,  check  all  the  Fock  matrices.  If 
the  error  starts  in  iterations  2  to  4,  the  error  probably  lies  in  CNVG, 
if  after  that,  in  PULAY,  if  used. 

If  SCF  is  achieved,  and  the  heat  of  formation  is  faulty,  check 
HELECT.  If  C.I.  was  used  check  MECI. 

If  the  derivatives  are  faulty,  use  DCART  to  verify  the  cartesian 
derivatives.  If  these  are  faulty,  check  DCART  and  DHC.  If  they  are 
correct,  or  not  calculated,  check  the  DERIV  finite  difference 
calculation.  If  the  wavefunction  is  non-variationally  optimized,  check 
DERNVO . 
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If  the  geometric  calculation  is  faulty,  use  FLEPO  to  monitor  the 
optimization,  DER1V  may  also  be  useful  here. 

For  the  FORCE  calculation,  DCART  or  DERIV  are  useful  for 
variationally  optimized  functions,  COMPFG  for  non-variationally 
optimized  functions. 

For  reaction  paths,  verify  that  FLEPO  is  working  correctly;  if  so, 
then  PATHS  is  faulty. 

For  saddle-point  calculations,  verify  that  FLEPO  is  working 
correctly;  if  so,  then  REACT 1  is  faulty. 

Keep  in  mind  the  fact  that  MOPAC  is  a  large  calculation,  and  while 
intended  to  be  versatile,  many  combinations  of  options  have  not  been 
tested.  If  a  bug  is  found  in  the  original  code,  please  communicate 
details  to  the  Academy,  to  Dr.  James  J.  P.  Stewart,  Frank  J.  Seiler 
Research  Laboratory,  U.S.  Air  Force  Academy,  Colorado  Springs,  CO 
80840-6528. 
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MOPAC  is  distributed  on  a  magnetic  tape  as  a  set  of  FORTRAN-77 
files,  along  with  ancillary  documents  such  as  command,  help,  data  and 
results  files.  The  format  of  the  tape  is  that  of  DIGITAL'S  VAX 
computers.  The  following  instructions  apply  only  to  users  with  VAX 
computers:  users  with  other  machines  should  use  the  following 
instructions  as  a  guide  to  getting  MOPAC  up  and  running. 

1.  Put  the  magnetic  tape  on  the  tape  drive,  write  protected. 

2.  Allocate  the  tape  drive  with  a  command  such  as  $ ALLOCATE  MTAO : 

3.  Go  into  an  empty  directory  which  is  to  hold  MOPAC 

4.  Mount  the  magnetic  tape  with  the  command  $MOUNT  MTAO:  MOPAC 

5.  Copy  all  the  files  from  the  tape  with  the  command 

$COPY  MTAO:*.*  * 


A  useful  operation  after  this  would  be  to  make  a  hard  copy  of  the 
directory.  You  should  now  have  the  following  sets  of  files  in  the 
directory: 

1.  A  file,  AAAINVOICE.TXT,  summarizing  this  list. 

2.  A  set  of  FORTRAN-77  files,  see  Appendix  1. 

3.  The  command  files  COMPILE,  MOPACCOM,  MOPAC,  RMOPAC,  and  SHUT. 

4.  A  file,  MOPAC. OPT,  which  lists  all  the  object  modules  used  by 
MOPAC. 

5.  Help  files  MOPAC. HLP  and  HELP. FOR 

6.  A  text  file  MOPAC. MAN. 

7.  A  manual  summarizing  the  updates,  called  UPDATE. MAN. 
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8.  Two  test-data  files:  TESTDATA.DAT  and  MNRSD1 . DAT ,  and 

corresponding  results  files,  TESTDATA.OUT  and  MNRSDl.OUT. 
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STRUCTURE  OF  COMMAND  FILES 
COMPILE 

The  parameter  file  DIMSIZES.DAT  should  be  read  and/  if  necessary, 
dified  before  COMPILE  is  run. 

DO  NOT  RUN  COMPILE  AT  THIS  TIME! I 

COMPILE  should  be  run  once  only.  It  assigns  DIMSIZES.DAT,  the 
ock  of  FORTRAN  which  contains  the  PARAMETERS  for  the  dimension  sizes 
the  logical  name  "SIZES".  This  is  a  temporary  assignment,  but  the 
er  is  strongly  recommended  to  make  it  permanent  by  suitably  modifying 
GIN  file(s).  COMPILE  is  a  modified  version  of  Maj  Donn  Storch's 
MPILE  for  DRAW-2. 

All  the  FORTRAN  files  are  then  compiled,  using  the  array  sizes 
ven  in  DIMSIZES.DAT:  these  should  be  modified  before  COMPILE  is  run. 
,  for  whatever  reason,  DIMSIZES.DAT  needs  to  be  changed,  then  COMPILE 
ould  be  re-run,  as  modules  compiled  with  different  DIMSIZES.DAT  will 
incompatible. 

The  parameters  within  DIMSIZES.DAT  that  the  user  can  modify  are 
tXLIT,  MAXHEV,  MAXTIM  and  MAXDMP.  MAXLIT  is  assigned  a  value  equal  to 
ie  largest  number  of  hydrogen  atoms  that  a  MOPAC  job  is  expected  to 
in,  MAXHEV  is  assigned  the  corresponding  number  of  heavy  (non-hydrogen) 
oms.  The  ratio  of  light  to  heavy  atoms  should  not  be  less  than  1/2. 

>  not  set  MAXHEV  or  MAXLIT  less  than  7.  If  you  do,  some  subroutines 
.11  not  compile  correctly.  Some  molecular  orbital  eigenvector  arrays 
e  overlapped  with  Hessian  arrays,  and  to  prevent  compilation  time 
ror  messages,  the  number  of  allowed  A.O.'s  must  be  greater  than,  or 
[ual  to  three  times  the  number  of  allowed  real  atoms.  MAXTIM  is  the 
(fault  maximum  time  in  seconds  a  job  is  allowed  to  run  before  either 
impletion  or  a  restart  file  being  written.  MAXDMP  is  the  default  time 
i  seconds  for  the  automatic  writing  of  the  restart  files.  If  your 
unputer  is  very  reliable,  and  disk  space  is  at  a  premium,  you  might 
int  to  set  MAXDMP  as  MAXDMP=999999 . 

If  SYBYL  output  is  wanted,  set  ISYBYL  to  1,  otherwise  set  it  to 

fro. 


If  you  want,  NMECI  can  be  changed.  Setting  it  to  1  will  save  some 
face,  but  will  prevent  all  C.I.  calculations  except  simple  radicals. 

If  you  want,  NPULAY  can  be  set  to  1.  This  saves  memory,  but  also 
sables  the  PULAY  converger. 

If  you  want,  MESP  can  be  varied.  This  is  only  meaningful  if  ESP  is 
(stalled. 

Compile  MOPAC.  This  operation  takes  about  7  minutes,  and  should  be 
in  "on-line",  as  a  question  and  answer  session  is  involved. 

When  everything  is  successfully  compiled,  the  object  files  will 
ten  be  assembled  into  an  executable  image  called  MOPAC . EXE .  Once  the 
tage  exists,  there  is  no  reason  to  keep  the  object  files,  and  if  space 
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is  at  a  premium  these  can  be  deleted  at  this  time. 

I£  you  need  to  make  any  changes  to  any  of  the  files,  COMPILE 
followed  by  the  names  of  the  changed  files  will  reconstruct  MOPAC, 
provided  all  the  other  OBJ  files  exist.  For  example,  if  you  change  the 
version  number  in  DIMSIZES.DAT,  then  READ. FOR  and  WRITE. FOR  are  affected 
and  will  need  to  be  recompiled.  This  can  be  done  using  the  command 

6C0MPILE  WRITE, READ 

In  the  unlikely  event  that  you  want  to  link  only,  use  the  command 
6C0MPILE  LINK 


Sometimes  the  link  stage  will  fail,  and  give  the  message 

"%LINK-E-INSVIRMEM,  insufficient  virtual  memory  for  2614711.  pages 
-LINK-E-NOIMGFIL ,  image  file  not  created", 

or  your  MOPAC  will  not  run  due  to  the  size  of  the  image.  In  these  cases 
you  should  ask  the  system  manager  to  alter  your  PGFLQUO  and  WSEXTENT 
limits.  Possibly  the  system  limits,  VIRTUALPAGECNT  CURRENT  and  MAX  will 
need  to  be  changed.  As  an  example,  on  a  Microvax  3600  with  16Mb  of 
memory,  PGFLQUO=50000,  WSEXTENT=16000,  VIRTUALPAGECNT  CURRENTS 07 68, 
VIRTUALPAGECNT  MAX=600000  are  sufficient  for  the  default  MOPAC  values  of 
43  heavy  and  43  light  atoms. 

In  order  for  users  to  have  access  to  MOPAC  they  must  insert  in 
their  individual  LOGlN.COM  files  the  line 

$8  <Mopac-directory>MOPACCOM 

where  <Mopac-directory>  is  the  name  of  the  disk  and  directory  which 
holds  all  the  MOPAC  files.  For  example,  DRAOs [MOPAC] ,  thus 

$@  DRAO: [MOPAC] MOPAC 

MOPACCOM.COM  should  be  modified  once  to  accommodate  local 
definitions  of  the  directory  which  is  to  hold  MOPAC.  This  change  must 
also  be  made  to  RMOPAC.COM  and  to  MOPAC.COM. 

MOPAC 

This  command  file  submits  a  MOPAC  job  to  a  queue.  Before  use, 
MOPAC.COM  should  be  modified  to  suit  local  conditions.  The  user's  VAX 
is  assumed  to  run  three  queues,  called  QUEUE 3,  QUEUE2 ,  and  QUEUE1.  The 
user  should  substitute  the  actual  names  of  the  VAX  queues  for  these 
symbolic  names.  Thus,  for  example,  if  the  local  names  of  the  queues  are 
"TWELVEHOUR" ,  for  jobs  of  length  up  to  12  hours,  "ONEHOUR",  for  jobs  of 
less  than  one  hour,  and  M30MINS”  for  quick  jobs,  then  in  place  of 
"0UEUE3",  "QUEUE2" ,  and  "QUEUEl"  the  words  "TWELVEHOUR",  "ONEHOUR",  and 
W30MINSN  should  be  inserted. 
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RMOPAC 

RMOPAC  is  the  command  file  for  running  MOPAC.  It  assigns  all  the 
data  files  that  MOPAC  uses  to  the  channels.  If  the  user  wants  to  use 
other  file-name  endings  than  those  supplied,  the  modifications  should  be 
made  to  RMOPAC. 

When  a  long  job  ends,  RMOPAC  will  also  send  a  mail  message  to  the 
user  giving  a  brief  description  of  the  job.  You  may  want  to  change  the 
default  definition  of  "a  long  job";  currently  it  is  12  hours.  This 
feature  was  written  by  Dr.  James  Petts  of  Kodak  Ltd  Research  Labs. 

A  recommended  sequence  of  operations  to  get  MOPAC  up  and  running 
would  be: 


1.  Modify  the  file  DIMSIZES.DAT.  The  default  sizes  are  40  heavy 
atoms  and  40  light  atoms.  Do  not  make  the  size  less  than  7  by 
7. 

2.  Read  through  the  COMMAND  files  to  familiarize  yourself  with 
what  is  being  done. 

3.  Edit  the  file  MOPAC.COM  to  use  the  local  queue  names. 

4.  Edit  the  file  RMOPAC.COM  if  the  default  file-names  are  not 
acceptable. 

5.  Edit  MOPACCOM.COM  to  assign  MOPACDI RECTORY  to  the  disk  and 
directory  which  will  hold  MOPAC. 

6.  Edit  the  individual  LOGIN.COM  files  to  insert  the  following 
line 

$@  <Mopac-di rector y>M0PACC0M 

Note  that  MOPACDIRECTORY  cannot  be  used,  as  the  definition 
Of  MOPACDIRECTORY  is  made  in  MOPACCOM.COM 

7.  Execute  the  modified  LOGIN  command  so  that  the  new  commands  are 
effective. 

8.  Run  COMPILE.COM.  This  takes  about  8  minutes  to  execute. 

9.  Enter  the  command 
$M0PAC 

You  will  receive  the  message 
"What  file?  :" 

to  which  the  reply  should  be  the  actual  data-file  name.  For 
example,  "MNRSD1",  the  file  is  assumed  to  end  in  .DAT, 
e.g.  MNRSD1.DAT. 

You  will  then  be  prompted  for  the  queue: 

"What  queue?  : " 

Any  queue  defined  in  MOPAC.COM  will  suffice: 

"SYS$ BATCH" 

Finally,  the  priority  will  be  requested: 
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"What  priority?  [5]:" 

To  which  any  value  between  1  and  5  will  suffice.  Note  that  the 
maximum  priority  is  limited  by  the  system  (manager). 


11.1  ESP  CALCULATION. 

As  supplied,  MOPAC  will  not  do  the  ESP  calculation  because  of  the 
large  memory  requirement  of  the  ESP.  To  install  the  BSP,  make  the 
following  changes: 

1.  Rename  ESP.ROF  to  ESP. FOR 

2.  Add  to  the  first  line  of  MOPAC. OPT  the  string  "  ESP,  "  (without 
the  quotation  marks). 

3.  Edit  MNDO.FOR  to  uncomment  the  line  "C#  CALL  ESP". 

4.  Compile  ESP  and  MNDO,  and  relink  MOPAC  using,  e.g.  COMPILE 
ESP,MNDO" . 

5.  If  the  resulting  executable  is  too  large,  modify  DIMSIZES.DAT 
to  reduce  MAXHEV  and  MAXLIT,  then  recompile  everything  and 
relink  MOPAC  with  "^COMPILE" . 


To  familiarize  yourself  with  the  system,  the  following  operations 
might  b<  jseful. 

1.  Run  the  (supplied)  test  molecules,  and  verify  that  MOPAC  is 
producing  "acceptable"  results. 

2.  Make  some  simple  modifications  to  the  datafiles  supplied  in 
order  to  test  your  understanding  of  the  data  format 

3.  When  satisfied  that  MOPAC  is  working,  and  that  data  files  can 
be  made,  begin  production  runs. 


Working  of  SHUTDOWN  command 

If,  for  whatever  reason,  a  run  needs  to  be  stopped  prematurely,  the 
command  $SHUT  <jobname>  can  be  issued.  This  will  execute  a  small 
command- language  file,  which  copies  the  data-file  to  form  a  new  file 
called  <f ilename>.END 

The  next  time  MOPAC  calls  function  SECOND,  the  presence  of  a 
readable  file  called  SHUTDOWN,  logically  identified  with  <f ilename>.END, 
is  checked  for,  and  if  it  exists,  the  apparent  elapsed  CPU  time  is 
increased  by  1,000,000  seconds,  and  a  warning  message  issued.  No 
further  action  is  taken  until  the  elapsed  time  is  checked  to  see  if 
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enough  time  remains  to  do  another  cycle.  Since  an  apparently  very  long 
time  has  been  used,  there  is  not  enough  time  left  to  do  another  cycle, 
and  the  restart  files  are  generated  and  the  run  stopped. 

SHUTDOWN  is  completely  machine  -  independent. 

Specific  instructions  for  mounting  MOPAC  on  other  computers  have 
been  left  out  due  to  limitations  of  space  in  the  Manual;  however,  the 
following  points  may  prove  useful: 

1.  Function  SECOND  is  machine-specific.  SECOND  is 
double-precision,  and  should  return  the  CPU  time  in  seconds, 
from  an  arbitary  zero  of  time.  If  the  SHUT  command  has  been 
issued,  the  value  returned  by  SECOND  should  be  increased  by 
1,000,000. 

2.  On  UNIX-based  and  other  machines,  on-line  help  can  be  provided 
by  using  help.f.  Documentation  on  help.f  is  in  help.f. 

3.  OPEN  and  CLOSE  statements  are  a  fruitful  source  of  problems. 
If  MOPAC  does  not  work,  most  likely  the  trouble  lies  in  these 
statements. 

4.  RMOPAC.COM  should  be  read  to  see  what  files  are  attached  to 
what  logical  channel. 

How  to  use  MOPAC 

The  COM  file  to  run  the  MOPAC  can  be  accessed  using  the  command 
"MOPAC"  followed  by  none,  one,  two  or  three  arguments.  Possible  options 
are: 

MOPAC  MYDATAFILE  120  4 

MOPAC  MYDATAFILE  120 

MOPAC  MYDATAFILE 

In  the  latter  case  it  is  assumed  that  the  shortest  queue  will  be 
adequate.  The  COM  file  to  run  the  MOPAC  can  be  accessed  using  the 
command  "MOPAC"  followed  by  none,  one  or  two  arguments.  Possible 
options  are: 

MOPAC  MYDATAFILE  120 
MOPAC  MYDATAFILE 

In  the  latter  case  it  is  assumed  that  the  default  time  (15  seconds)  will 
be  adequate. 

MOPAC 


In  this  case  you  will  be  prompted  for  the  datafile,  and  then  for 
the  queue.  Restarts  should  be  user  transparent.  If  MOPAC  does  make  any 
restart  files,  do  not  change  them  (It  would  be  hard  to  do  anyhow,  as 
they're  in  machine  code),  as  they  will  be  used  when  you  run  a  RESTART 
job.  The  files  used  by  MOPAC  are: 
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File 


Description 


<£ i lename> . DAT 
<f ilename>.OUT 
<f ilename>.RES 
<f ilename>.DEN 
SYS$OUTPUT 
<filename>.ARC 
<filename>.GPT 
<filename>.SYB 
SETUP . DAT 


Data 

Results 

Restart 

Density  matrix  (in  binary) 
LOG  file 

Archive  or  summary 
Data  for  program  DENSITY 
SYBYL  data 
SETUP  data 


Logical  name 

'FOR005 

FOR006 

FOR009 

FOROIO 

FOR011 

FOR012 

FOR013 

FOR016 

SETUP 
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SHORT  VERSION 

For  various  reasons  it  might  not  be  practical  to  assemble  the 
entire  MOPAC  program.  For  example ,  your  computer  may  have  memory 
limitations/  or  you  may  have  very  large  systems  to  be  run,  or  some 
options  may  never  be  wanted.  For  whatever  reason,  if  using  the  entire 
program  is  undesirable,  an  abbreviated  version,  which  lacks  the  full 
range  of  options  of  the  whole  program,  can  be  specified  at  compile  time. 

At  the  bottom  of  the  DIMSIZES.DAT  file  the  programmer  is  asked  for 
various  options  to  be  used  in  compiling.  These  options  allow  arrays  of 
MECI,  PULAY,  and  ESP  to  assume  their  correct  size. 

As  long  as  no  attempt  is  made  to  use  the  reduced  subroutines,  the 
program  will  function  normally.  If  an  attempt  is  made  to  use  an  option 
which  has  been  excluded  then  the  program  will  error. 


157 


INSTALLING  MOPAC 


Page  11-10 


Size  of  MOPAC 

The  amount  of  storage  required  by  MOPAC  depends  mainly  on  the 
number  of  heavy  and  light  atoms.  As  it  is  useful  for  programmers  to 
have  an  idea  of  how  large  various  MOPACs  are,  the  following  data  are 
presented  as  a  guide. 


Sizes  of  various  MOPAC  Version  6.00  executables  in  which  the  number 
of  heavy  atoms  is  equal  to  the  number  of  light  atoms,  assembled  on 
a  VAX  computer,  are: 

No.  of  heavy  atoms  Size  of  Executable  (Kbytes) 


MOPAC  5.00 

MOPAC  6.00 

(AMPAC 

10 

1,653 

2,054 

N/A 

20 

3,442 

4,689 

4,590 

30 

6,356 

8,990 

9,150 

40 

10,400 

14,955 

15,588 

50 

15,572 

22,586 

23,944 

60 

21,872 

31,880 

34,145 

100 

58,361 

87,519 

200 

228,602 

336.867 

300 

511,723 

754,540 

The  size  of  any  given  MOPAC  executable  may  be  estimated  from 

MOPAC  5.00  Size  *  9939  +  N*  9.57  +  N*N*5.64  Kbytes 

MOPAC  6.00  Size  =  1091  +  N*13.40  +  N*N*8.33  Kbytes 

The  large  increase  in  size  of  MOPAC  was  caused  mainly  by  the  inclusion 
of  the  analytical  C.I.  derivatives.  Because  they  are  so  much  more 
efficient  and  accurate  than  finite  differences,  and  because  computer 
memory  is  becoming  more  available,  this  increase  was  accepted  as  the 
lesser  of  two  evils. 

The  size  of  MOPAC  executables  will  vary  from  machine  to  machine, 
due  to  the  different  sizes  of  the  code.  For  a  VAX,  this  amounts 
to  approximately  0.1Mb.  Most  machines  use  a  64  bit  or  8  byte 
double  precision  real  number,  so  the  multipliers  of  N  and  N*N 
should  apply  to  them.  For  large  jobs,  0.1Mb  is  negligable,  therefore 
the  above  expression  should  be  applicable  to  most  computers. 

No.  of  lines  in  program  in 

Version  5.00  =  22,084  =  17,718  code  +  4,366  comment. 

Version  6.00  *  31,857  =  22,526  code  +  9,331  comment. 
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NAMES  OF  FORTRAN-77  FILES 


AABABC 

ANALYT 

ANAVIB 

AXIS 

BLOCK 

BONDS 

CALPAR 

CAPCOR 

CD  I  AG 

CHRGE 

CNVG 

COMPFG 

DCART 

DELMOL 

DELRI 

DENROT 

DENS IT 

DEPVAR 

DERI1 

DERI  2 

DERI 21 

DERI 2 2 

DERI 2 3 

DERITR 

DERNVO 

DERS 

DFOCK2 

DFPSAV 

DIAG 

DIAT 

DI  IS 

DIJKL1 

DIJKL2 

DIPIND 

DIPOLE 

DOFS 

DRC 

DRCOUT 

EF 

ENPART 

EXCHNG 

FFHPOL 

FMAT 

FOCK1 

FOCK2 

FORCE 

FORMXY 

FORSAV 

FREQCY 

GEOUT 

GEOUTG 

GETGEG 

GETGEO 

GETSYM 

GMETRY 

GOVER 

GRID 

HI ELEC 

HADDON 

HCORE 

HQRII 

IJKL 

INTERP 

ITER 

JCARIN 

LINMIN 

LOCMIN 

MAMULT 

MATOUT 

MATPAK 

MECI 

MECID 

MECIP 

MNDO 

MOLDAT 

MOLVAL 

MULLIK 

MULT 

NUCHAR 

PARSAV 

PARTXY 

PATHK 

PATHS 

PERM 

POWSAV 

POWSQ 

PRTDRC 

QUADR 

REACT1 

READ 

REFER 

REPP 

ROTAT 

ROTATE 

RSP 

SEARCH 

SETUPG 

SOLROT 

SWAP 

SYMTRY 

THERMO 

TIMER 

VECPRT 

WRITE 

WRTKEY 

WRTTXT 

XYZINT 

BRLZON 

DATIN 

DERIO 

DERIV 

DIAT2 

DOT 

FLEPO 

FRAME 

GETTXT 

HELECT 

LOCAL 

MECIH 

NLLSQ 

POLAR 

READA 

SECOND 

UPDATE 
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A  list  o£  the  program  segments  which  call  various  subroutines. 


SUBROUTINE  CALLS 


AABABC 

AABACD 

AABBCD 

AINTGS 

ANALYT 

ANAVIB 

DERS 

DELRI 

DELMOL 

AXIS 

RSP 

BABBBC 

BABBCD 

BANGLE 

BFN 

BINTGS 

BKRSAV 

GEOUT 

BONDS 

VECPRT 

MPCBDS 

BRLZON 

CALPAR 

CAPCOR 

CDIAG 

DOFS 

CDIAG 

CHRGE 

ME08A 

EC08C 

SORT 

CNVG 

COE 

CONPPG 

SETUPG 

SYMTRY 

GMETRY 

TIMER 

HCORE 

ITER 

DIBED 

DERIV 

MECIP 

DANG 

DATIN 

UPDATE 

MOLDAT 

CALPAR 

DC  ART 

ANALYT 

DHC 

DIHED 

DELMOL 

DELRI 

ROTAT 

DENROT 
DENS IT 

GMETRY 

COE 

DEPVAR 

DERIO 

DERI1 

TIMER 

DHCORE 

SCOPY 

DFOCK2 

SUPDOT 

MTXM 

DIJKL1 

MECID 

MECIH 

SUPDOT 

TIMER 

MXM 
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DERI  2 

DERI 21 

DERI  2  2 

MXM 

OSINV 

MTXM 

SCOPY 

DERI  2  3 

DIJKL2 

MECID 

MECIH 

SUPDOT 

DERI21 

MTXMC 

HQRII 

MXM 

DERI 2 2 

MXM 

MXMT 

FOCK2 

FOCK1 

SUPDOT 

DERI 2 3 

scopy 

DERITR 

SYMTRy 

GMETRY 

HCORE 

ITER 

DERIV 

DERNVO 

DCART 

JCARIN 

MXM 

GEOUT 

DERITR 

DERNVO 

DERIO 

DERI1 

DERI  2 

DERS 

DFOCK2 

JAB 

KAB 

DPPSAV 

XYZINT 

GEOUT 

DHC 

H1ELEC 

ROTATE 

SOLROT 

FOCK2 

DHCORE 

HI ELEC 

ROTATE 

DIAG 

EPSETA 

DIAGI 

DIAT 

COE 

GOVER 

DIAT  2 

DIAT2 

SET 

DIHED 

DANG 

DIIS 

SPACE 

VECPRT 

MINV 

DIJKL1 

FORMXY 

DIJKL2 

DIPIND 

CHRGE 

GMETRY 

DIPOLE 

|  DOFS 

DRC 

GMETRY 

COMPFG 

PRTDRC 

DRCOUT 

EA08C 

EA09C 

EA09C 

EC08C 

EA08C 

|  EF 

BKRSAV 

COMPFG 

BKRSAV 

UPDHES  HQRII 

FORMD 

SYMTRY 

ENPART 

EPSETA 

EXCHNG 


FFHPOL 

COMPFG 

DIPIND 

VECPRT 

RSP 

MATOUT 

|  FLEPO 

DFPSAV 

COMPFG 

SCOPY 

GEOUT 

SUPDOT 

LINMIN 

DIIS 

FMAT 

FORSAV 

COMPFG 

CHRGE 

FOCK2 

FOCK2D 

JAB 

KAB 

FORCE 

GMETRY 

COMPFG 

NLLSQ 

FLEPO 

WRITE 

XYZINT 

AXIS 

FMAT 

VECPRT 

FRAME 

RSP 

MATOUT 

FREQCY 

MATOUT 

DRC 

ANAVIB 

THERMO 

1  FORMD 
|  FORMXY 
FORSAV 

OVERLP 

FRAME 

AXIS 

FREQCY 

BRLZON 

FRAME 

RSP 

GEOUT 

XYZINT 

WRTTXT 

CHRGE 

GEOUTG 

XXX 

GETDAT 
|  GETGEG 

GETVAL 

GETVAL 

GETVAL 

GETGEO 

GEOUT 

NUCHAR 

XYZINT 

GETSYM 
|  GETTXT 

UPCASE 

GMETRY 

GOVER 

GEOUT 
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I 


I 


I 


GRID 

DPPSAV 

FLEPO 

GEOUT  WRTTXT 

H1ELEC 

DIAT 

HADDON 

DEPVAR 

BCORE 

H1ELEC 

ROTATE 

SOLROT  VECPRT 

HELECT 

HQRII 

IJKL 

PARTXY 

INTERP 

HQRII 

SCHMIT 

SCHMIB  SPLINE 

ITER 

EPSETA 

VECPRT 

FOCK2  FOCK1  WRITE 

INTERP 

PULAY 

HQRII 

DIAG 

MATOUT  SWAP 

DENSIT  CNVG 

JAB 

JCARIN 

SYMTRY 

GMETRY 

KAB 

LINHIN 

COMPFG 

EXCHNG 

LOCAL 

MATOUT 

LOCNIN 

COMPPG 

EXCHNG 

MNDO 

GETDAT 

READ  MOLDAT  DATIN 

REACT 1 

GRID 

PATHS 

PATHK 

FORCE  DRC  NLLSQ 

COMPFG 

POWSQ 

EF 

FLEPO 

WRITE  POLAR 

MAMULT 

MATOUT 

ME08A 

ME08B 

ME08B 

MECI 

IJKL 

PERM  MECIH  VECPRT 

HQRII 

MATOUT 

MECIH 

MECIP 

KINV 

MXM 

MOLDAT 

REFER 

GMETRY  VECPRT 

MOLVAL 

MPCBDS 

MPCPOP 

MPCSYB 

MTXM 

MTXMC 

MXM 

MULLIK 

RSP 

GMETRY 

MULT  DENSIT  VECPRT 

MULT 

MXM 

MXMT 

NLLSQ 

NUCHAR 

PARSAV 

COMPFG 

GEOUT  LOCMIN 

PARSAV 

OSINV 

OVERLP 

PARSAV 

XYZINT 

GEOUT 

PARTXY 

FORMXY 

PATHK 

DFPSAV 

FLEPO 

GEOUT  WRTTXT 

PATHS 

PERM 

DFPSAV 

FLEPO 

WRITE 

POLAR 

GMETRY 

AXIS 

COMPFG  FFHPOL 

POWSAV 

XYZINT 

GEOUT 

POWSQ 

POWSAV 

COMPFG 

VECPRT  RSP 

SEARCH 

PRTDRC 

CHRGE 

XYZINT 

QUADR 

PULAY 

QUADR 

MAMULT 

OSINV 

REACT 1 

GETGEO 

SYMTRY 

GEOUT  GMETRY 

FLEPO 

COMPFG 

WRITE 
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READ 

GETTXT 

GETGEG 

GETGEO 

SYMTRY 

NUCHAR 

WRTTXT 

REFER 

REPP 

ROTAT 

ROTATE 

REPP 

RSP 

SAXPY 

SCHMIB 

EPSETA 

TRED3 

TQLRAT 

SCHMIT 

SCOPY 

SEARCH 

SECOND 

COMPFG 

SET 

AINTGS 

BINTGS 

SETUPG 

SOLROT 

ROTATE 

SORT 

SPACE 

SPLINE 

BFN 

SUPDOT 

SWAP 

SYMTRY 

THERMO 

TIMCLK 

TIMER 

TIMOUT 

TQL2 

TQLRAT 

TRBAK3 

TRED3 

HADDON 

UPCASE 

UPDATE 

UPDHES 

VECPRT 

WRITE 

DATE 

WRTTXT 

GEOUT 

VECPRT 

MATOUT 

CHRGE 

LOCAL 

ENPART 

MULLIK 

WRTKEY 

WRTTXT 

XXX 

XYZGEO 

BANGLE 

DIHED 

XYZINT 

DIHED 

BANGLE 

XYZGEO 

DATE  GEOUT  WRTKEY  GETSYM 
GMETRY 


TQL2  TRBAK3 


DERIV  TIMOUT  SYMTRY  GMETRY  GEOUT 
BRLZON  MPCSYB  DENROT  MOLVAL  BONDS 
MPCPOP  GEOUTG 
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A  list  of  subroutines  called  by  various  segments  (the  inverse  of  the 
first  list) 

Subroutine  Called  by 


AABABC 

MECIH 

AABACD 

MECIH 

AABBCD 

MECIH 

AINTGS 

SET 

ANALYT 

DCART 

ANAVIB 

FORCE 

AXIS 

FORCE 

FRAME 

BABBBC 

MECIH 

BABBCD 

MECIH 

BANGLE 

XYZGEO 

XYZINT 

BFN 

SPLINE 

BINTGS 

SET 

|  BKRSAV 

EF 

BONDS 

WRITE 

BRLZON 

FREQCY 

WRITE 

CALPAR 

DATIN 

CAPCOR 

ITER 

|  CDIAG 

BRLZON 

CHRGE 

DIPIND 

FMAT 

CNVG 

ITER 

COE 

DENROT 

DIAT 

|  COMPFG 

DRC 

EF 

FORCE 

LINMIN 

POLAR 

POWSQ 

DANG 

DIHED 

DATIN 

MNDO 

DCART 

DERITR 

DELMOL 

ANALYT 

DELHI 

ANALYT 

DENROT 

WRITE 

DENS IT 

ITER 

MULLIK 

DEPVAR 

HADDON 

DERIO 

DERNVO 

DERI1 

DERNVO 

DERI  2 

DERI  2 

DERNVO 

DERI 21 

DERI  2 

DERI 2 2 

DERI  2 

DERI 2 3 

DERI  2 

DERITR 

DERITR 

DERNVO 

DERITR 

DERS 

ANALYT 

1  DFOCK2 

DERIl 

1  DFPSAV 

FLEPO 

GRID 

DHC 

DCART 

DERI1 

DHCORE 

DERI1 

DIAG 

DERI21 

ITER 

DIAGI 

DERI21 

DIAT 

DIAT 

HI ELEC 

DIAT2 

DIAT 

DIBED 

COMPFG 

DCART 

|  DIIS 

FLEPO 

DIJKL1 

OERI1 

POLAR 


GEOUT 

PRTDRC 

WRITE 

FFHPOL 

FLEPO 

FMAT 

LOCMIN 

MNDO 

NLLSQ 

REACT 1 

SEARCH 

PATHK  PATHS 


XYZGEO  XYZINT 
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I  DIJKL2 

DERI  2 

DIPIND 

FFHPOL 

DIPOLE 

FMAT 

WRITE 

,  DOFS 

BRLZON 

DRC 

FORCE 

MNDO 

DRCOUT 

PRTDRC 

EA08C 

EC08C 

EA09C 

EA08C 

EC08C 

CDIAG 

EF 

MNDO 

ENPART 

WRITE 

EPSETA 

DIAG 

ITER 

RSP 

EXCHNG 

LINMIN 

LOCMIN 

FFHPOL 

POLAR 

|  FLEPO 

FORCE 

GRID 

MNDO 

REACT1 

FMAT 

FORCE 

|  FOCK2 

DERI 2 2 

DHC 

ITER 

FORCE 

MNDO 

FORMD 

EF 

FORMXY 

DIJKL1 

PARTXY 

FORSAV 

FMAT 

FRAME 

FORCE 

FREQCY 

FREQCY 

FORCE 

GEOUT 

BKRSAV 

DERITR 

DFPSAV 

GMETRY 

GRID 

NLLSQ 

POWSAV 

REACT1 

READ 

WRITE 

WRITE 

|  GEOUTG 

WRITE 

GETDAT 

MNDO 

|  GETGEG 

READ 

GETGEO 

REACT 1 

READ 

GETSYM 

READ 

GETTXT 

READ 

GMETRY 

COMPFG 

DENROT 

DERITR 

FORCE 

JCARIN 

MOLDAT 

REACT 1 

READ 

WRITE 

GOVER 

DIAT 

GRID 

MNDO 

HI ELEC 

DHC 

DHCORE 

HCORE 

HADDON 

SYMTRY 

HCORE 

COMPFG 

DERITR 

HELECT 

DCART 

DERI  2 

ITER 

|  HQRII 

EF 

INTERP 

ITER 

IJKL 

MECI 

INTERP 

ITER 

ITER 

COMPFG 

DERITR 

JAB 

DFOCK2 

FOCK2 

JCARIN 

DERITR 

KAB 

DFOCK2 

FOCK2 

LINMIN 

FLEPO 

LOCAL 

WRITE 

LOCMIN 

NLLSQ 

MNDO 

(main  segment) 

MAMULT 

PULAY 

PATHK 


FLEPO 

PARSAV 


DIPIND 
MULL IK 


MECI 


PATHS 


GETGEO 

PATHK 


DRC 

POLAR 
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MATOUT 

FFHPOL 

FORCE 

ITER 

LOCAL 

MECI 

WRITE 

ME08A 

CDIAG 

ME08B 

ME08A 

MECI 

COMPFG 

DERI1 

DERI  2 

MECI 

MECIH 

DERI1 

DERI  2 

MECI 

MECIP 

COMPFG 

MINV 

DIIS 

MOLDAT 

DATIN 

MNDO 

HOLVAL 

WRITE 

MPCBDS 

BONDS 

MPCPOP 

WRITE 

MPCSYB 

WRITE 

MTXM 

DERI1 

DERI  2 

DERI 21 

MTXMC 

DERI 21 

MULLIK 

WRITE 

MULT 

MULLIK 

MXM 

DERI1 

DERI  2 

DERI 21 

DERI 2 2 

DERITR 

MECIP 

MTXMC 

MXMT 

DERI 2 2 

NLLSQ 

FORCE 

MNDO 

NUCHAR 

GETGEO 

READ 

OSINV 

DERI  2 

PULAY 

OVERLP 

FORMD 

PARSAV 

NLLSQ 

PARTXY 

IJKL 

|  PATHK 

MNDO 

PATHS 

MNDO 

PERM 

MECI 

POLAR 

MNDO 

POWSAV 

POWSQ 

POWSQ 

MNDO 

PRTDRC 

DRC 

PULAY 

ITER 

QUADR 

PRTDRC 

REACT 1 

MNDO 

READ 

MNDO 

REFER 

MOLDAT 

REPP 

ROTATE 

ROTAT 

DELMOL 

DHC 

DHCORE 

HCORE 

SOLROT 

ROTATE 

DHC 

DHCORE 

HCORE 

SOLROT 

RSP 

AXIS 

FFHPOL 

FORCE 

FREQCY 

MULLIK 

POWSQ 

SCHMIB 

INTERP 

SCHMIT 

INTERP 

|  SCOPY 

DERI1 

DERI  2 

DERI  2 3 

FLEPO 

SEARCH 

POWSQ 

|  SECOND 

DERI  2 

DRC 

EF 

ESP 

FLEPO 

FMAT 

FORCE 

GRID 

ITER 

MNDO 

1 

NLLSQ 

PATHK 

PATHS 

POWSQ 

REACT1 

TIMER 

WRITE 

SET 

COMPFG 

DIAT2 

SETUPG 

COMPFG 

SOLROT 

DHC 

HCORE 

|  SORT 

CDIAG 
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SPACE 

DIIS 

SPLINE 

INTERP 

SUPDOT 

DERI1 

DERI1 

DERI  2 

DERI 22 

FLEPO 

SWAP 

ITER 

SYMTRY 

COMPFG 

DERITR 

EF 

JCARIN 

REACT1 

READ 

WRITE 

THERMO 

FORCE 

TIMCLK 

SECOND 

TIMER 

COMPFG 

DERI1 

DERI1 

TIMOUT 

WRITE 

TQL2 

RSP 

TQLRAT 

RSP 

TRBAK3 

RSP 

TRED3 

RSP 

UPCASE 

GETTXT 

UPDATE 

DAT  IN 

UPDHES 

EF 

VECPRT 

BONDS 

DIIS 

FFHPOL 

FORCE 

HCORE 

ITER 

WRITE 

MECI 

MOLDAT 

MULL IK 

POWSQ 

WRITE 

FORCE 

ITER 

MNDO 

PATHS 

REACT1 

WRTKEY 

READ 
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DESCRIPTION  OF  SUBROUTINES  IN  MOPAC 


Utility:  Calculates  the  configuration  interaction  matrix 
element  between  two  configurations  differing  by  exactly 
one  alpha  M.O.  Called  by  MECI  only. 

Utility:  Calculates  the  configuration  interaction  matrix 
element  between  two  configurations  differing  by  exactly 
two  alpha  M.O.'s.  Called  by  MECI  only. 

Utility:  Calculates  the  configuration  interaction  matrix 
element  between  two  configurations  differing  by  exactly 
two  M.O.'s;  one  configuration  has  alpha  M.O.  "A"  and  beta 
M.O.  "C"  while  the  other  configuration  has  alpha  M.O.  "B" 
and  beta  M.O.  "D".  Called  by  MECI  only. 

Utility:  Within  the  overlap  integrals,  calculates  the 
A-integrals.  Dedicated  to  function  SS  within  DIAT. 

Main  Sequence:  Calculates  the  analytical  derivatives 
of  the  energy  with  respect  to  cartesian  coordinates  for  all 
atoms.  Use  only  if  the  mantissa  is  short  (less  than  52  bits) 
or  out  of  interest.  Should  not  be  used  for  routine  work 
on  a  VAX. 

Utility:  Gives  a  brief  interpretation  of  the  modes  of 
vibration  of  the  molecule.  The  principal  pairs  of  atoms 
involved  in  each  vibration  are  identified,  and  the  mode 
of  motion  (tangential  or  radial)  is  output. 

Utility:  Works  out  the  three  principal  moments  of  inertia 
of  a  molecule.  If  the  system  is  linear,  one  moment  of  inertia 
is  zero.  Prints  moments  in  units  of  cm**(-l)  and 
10** (-40)  gram-cm-cm. 

Utility:  Calculates  the  configuration  interaction  matrix 
element  between  two  configurations  differing  by  exactly 
one  beta  M.O.  Called  by  MECI  only. 

Utility:  Calculates  the  configuration  interaction  matrix 
element  between  two  configurations  differing  by  exactly 
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BFN 

BINTGS 

BKRSAV 

BONDS 
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CHRGE 

CNVG 

COE 

COMPFG 

DANG 


two  beta  M.O.'s.  Called  by  MECI  only. 

Utility:  Given  a  set  of  coordinates,  BANGLE  will  calculate  the 
angle  between  any  three  atoms. 

Utility:  Calculates  the  B-functions  in  the  Slater  overlap. 

Utility:  Calculates  the  B-functions  in  the  Slater  overlap. 

Utility:  Saves  and  restores  data  used  by  the  eigenvector 
following  subroutine.  Called  by  EF  only. 

Utility:  Evaluates  and  prints  the  valencies  of  atoms  and 
bond-orders  between  atoms.  Main  argument:  density  matrix. 

No  results  are  passed  to  the  calculation,  and  no  data 
are  changed.  Called  by  WRITE  only. 

Main  Sequence:  BRLZON  generates  a  band  structure,  or  phonon 
structure,  for  high  polymers.  Called  by  WRITE  and  FREQCY. 

Utility:  When  external  parameters  are  read  in  via  EXTERNAL = , 
the  derived  parameters  are  worked  out  using  CALPAR.  Note  that 
all  derived  parameters  are  calculated  for  all  parameterized 
elements  at  the  same  time. 

Utility:  Capping  atoms,  of  type  Cb,  should  not  contribute  to 
the  energy  of  a  system.  CAPCOR  calculates  the  energy 
contribution  due  to  the  Cb  and  subtracts  it  from  the 
electronic  energy. 

Utility:  Complex  diagonalization.  Used  in  generating  eigenvalues 
of  complex  Hermitian  secular  determinant  for  band  structures. 
Called  by  BRLZON  only. 

Utility:  Calculates  the  total  number  of  valence  electrons 
on  each  atom.  Main  arguments:  density  matrix,  array  of 
atom  charges  (empty  on  input).  Called  by  ITER  only. 

Utility:  Used  in  SCF  cycle.  CNVG  does  a  three-point 
interpolation  of  the  last  three  density  matrices. 

Arguments:  Last  three  density  matrices.  Number  of  iterations, 
measure  of  self-consistency  (empty  on  input).  Called  by  ITER 
only. 

Utility:  Within  the  general  overlap  routine  COE  calculates 
the  angular  coefficients  for  the  s,  p  and  d  real  atomic 
orbitals  given  the  axis  and  returns  the  rotation  matrix. 

Main  Sequence:  Evaluates  the  total  heat  of  formation  of  the 
supplied  geometry,  and  the  derivatives,  if  requested.  This 
is  the  nodal  point  connecting  the  electronic  and  geometric 
parts  of  the  program.  Main  arguments:  on  input:  geometry, 
on  output:  heat  of  formation,  gradients. 

Utility:  Called  by  XYZINT,  DANG  computes  the  angle  between  a 
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point,  the  origin,  and  a  second  point. 

Utility:  Reads  in  external  parameters  for  use  within 
MOPAC.  Originally  used  for  the  testing  of  new  parameters, 

DATIN  is  now  a  general  purpose  reader  for  parameters. 

Invoked  by  the  keyword  EXTERNAL. 

Utility:  Called  by  DERIV,  DCART  sets  up  a  list  of  cartesian 
derivatives  of  the  energy  W.R.T.  coordinates  which  DERIV  can 
then  use  to  construct  the  internal  coordinate  derivatives. 

Utility:  Part  of  analytical  derivates.  Two-electron. 

Utility:  Part  of  analytical  derivates.  Two-electron. 

Utility:  Converts  the  ordinary  density  matrix  into 
a  condensed  density  matrix  over  basis  functions  s  (sigma), 
p  (sigma)  and  p  (pi),  i.e.,  three  basis  functions.  Useful 
in  hybridization  studies.  Has  capability  to  handling  "d" 
functions,  if  present. 

Utility:  Constructs  the  Coulson  electron  density  matrix  from 
the  eigenvectors.  Main  arguments:  Eigenvectors,  No.  of  singly 
and  doubly  occupied  levels,  density  matrix  (empty  on  input) 
Called  by  ITER. 

Utility:  A  symmetry-defined  "bond  length"  is  related  to 
another  bond  length  by  a  multiple.  This  special  symmetry 
function  is  intended  for  use  in  Cluster  calculations. 

Called  by  HADDON. 

Utility:  Part  of  the  analytical  C.I.  derivative  package. 
Calculates  the  diagonal  dominant  part  of  the  super-matrix. 

Utility:  Part  of  the  analytical  C.I.  derivative  package. 
Calculates  the  frozen  density  contribution  to  the  derivative  of 
the  energy  w.r.t.  cartesian  coordinates,  and  the  derivatives  of 
the  frozen  Fock  matrix  n  M.O.  basis.  It's  partner  is  DERI 2 . 

Utility:  Part  of  the  analytical  C.I.  derivative  package. 
Calculates  the  relaxing  density  contribution  to  the  derivative 
of  the  energy  w.r.t.  cartesian  coordinates.  Uses  the  results  of 
DERI1 . 

Utility:  Part  of  the  analytical  C.I.  derivative  package. 

Called  by  DERI 2  only. 

Utility:  Part  of  the  analytical  C.I.  derivative  package. 

Called  by  DERI 2  only. 

Utility:  Part  of  the  analytical  C.I.  derivative  package. 

Called  by  DERI 2  only. 

Utility:  Calculates  derivatives  of  the  energy  w.r.t.  internal 
coordinates  using  full  SCF's.  Used  as  a  foolproof  way  of 
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calculating  derivatives.  Not  recommended  for  normal  use. 

Main  Sequence:  Calculates  the  derivatives  of  the  energy  with 
respect  to  the  geometric  variables.  This  is  done  either  by 
using  initially  cartesian  derivatives  (normal  mode),  by 
analytical  C.I.  RHF  derivatives,  or  by  full  SCF  calculations 
(NOANCI  in  half-electron  and  C.I.  mode).  Arguments:  on 
input:  geometry,  on  output:  derivatives.  Called  by  COMPFG. 

Analytical  C.I.  Derivative  main  subroutine.  Calculates  the 
derivative  of  the  energy  w.r.t.  geometry  for  a  non-variationally 
optimized  wavefunction  (a  SCF-CI  wavefunction) . 

Utility:  Called  by  ANALYT,  DERS  calculates  the  analytical 
derivatives  of  the  overlap  matrix  within  the  molecular  frame. 

Utility:  A  function  called  by  ESP. 

Utility:  Part  of  the  analytical  C.I.  derivative  package.  Called 
by  DERI1,  DFOCK2  calculates  the  frozen  density  contribution  to 
the  derivative  of  the  energy  w.r.t.  cartesian  coordinates. 

Utility:  Saves  and  restores  data  used  by  the 
BFGS  geometry  optimization.  Main  arguments: 

parameters  being  optimized,  gradients  of  parameters,  last  heat 
of  formation,  integer  and  real  control  data.  Called  by  FLEPO. 

Utility:  Called  by  DCART  and  calculates  the  energy  of  a  pair 
of  atoms  using  the  SCF  density  matrix.  Used  in  the  finite 
difference  derivatve  calculation. 

Utility:  Part  of  the  analytical  C.I.  derivative  package.  Called 
by  DERI1 ,  DHCORE  calculates  the  derivatives  of  the  1  and  2 
electron  integrals  w.r.t.  cartesian  coordinates. 

Utility:  Rapid  pseudo-diagonalization.  Given  a  set  of  vectors 
which  almost  block-diagonalize  a  secular  determinant,  DIAG 
modifies  the  vectors  so  that  the  block-diagonalization  is  more 
exact.  Main  arguments:  Old  vectors.  Secular  Determinant, 

New  vectors  (on  output).  Called  by  ITER. 

Utility:  Calculates  the  electronic  energy  arising  from 
a  given  configuration.  Called  by  MECI. 

Utility:  Calculates  overlap  integrals  between  two  atoms  in 
general  cartesian  space.  Principal  quantum  numbers  up  to  6,  and 
angular  quantum  numbers  up  to  2  are  allowed.  Main  arguments: 
Atomic  numbers  and  cartesian  coordinates  in  Angstroms  of  the 
two  atoms.  Diatomic  overlaps  (on  exit).  Called  by  HI ELEC. 

Utility:  Calculates  reduced  overlap  integrals  between  atoms 
of  principal  quantum  numbers  1,  2,  and  3,  for  s  and  p  orbitals. 
Faster  than  the  SS  in  DIAT.  This  is  a  dedicated  subroutine,  and 
is  unable  to  stand  alone  without  considerable  backup.  Called 
by  DIAT. 
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Utility:  Part  of  READA.  DIGIT  assembles  numbers  given  a 
character  string. 

Utility:  Calculates  the  dihedral  angle  between  four  atoms. 

Used  in  converting  from  cartesian  to  internal  coordinates. 

Utility:  Pulay's  Geometric  Direct  Inversion  of  the  Iterative 
Subspace  (G-DIIS)  accelerates  the  rate  at  which  the  BFGS 
locates  an  energy  minimum.  (In  MOPAC  6.00,  the  DIIS  is  only 
partially  installed  -  several  capabilities  of  the  DIIS  are  not 
used) 

Utility:  Part  of  the  analytical  C.I.  derivative  package.  Called 
by  DERI1,  DIJKL1  calculates  the  two-electron  integrals  over 
M.O.  bases,  e.g.  <i,j  (1/r)  k,l>. 

Utility:  Part  of  the  analytical  C.I.  derivative  package.  Called 
by  DERI 2 ,  DIJKL2  calculates  the  derivatives  of  the  two-electron 
integrals  over  M.O.  bases,  e.g.  <i,j  (1/r)  k,l>,  w.r.t. 
cartesian  coordinates. 

Utility:  Similar  to  DIPOLE,  but  used  by  the  POLAR  calculation 
only. 

Utility:  Evaluates  and,  if  requested,  prints  dipole  components 
and  dipole  for  the  molecule  or  ion.  Arguments:  Density  matrix. 
Charges  on  every  atom,  coordinates,  dipoles  (on  exit). 

Called  by  WRITE  and  FMAT. 

Utility:  Called  by  ESP  only,  DIST2  works  out  the  distance 
between  two  points  in  3D  space. 

Main  Sequence:  Calculates  the  density  of  states  within  a 
Brillouin  zone.  Used  in  polymer  work  only. 

Utility:  Given  two  vectors,  X  and  Y,  of  length  N,  function  DOT 
returns  with  the  dot  product  X.Y.  I.e.,  if  X=Y,  then  DOT  =  the 
square  of  X.  Called  by  TLEPO. 

Main  Sequence:  The  dynamic  and  intrinsic  reaction  coordinates 
are  calculated  by  following  the  mass-weighted  trajectories. 

Utility:  Sets  up  DRC  and  IRC  data  in  quadratic  form 
preparatory  to  being  printed. 

Part  of  the  diagonalizer  RSP. 

Part  of  the  diagonalizer  RSP. 

Part  of  the  diagonalizer  RSP. 

Main  Sequence:  EF  is  the  Eigenvector  Following  routine. 

EF  implements  the  keywords  EF  and  TS. 

Utility:  Within  the  ESP,  ELESP  calculates  the  electronic 
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contribution  to  the  electrostatic  potential. 

Utility:  Partitions  the  energy  of  a  molecule  into  its  monatomic 
and  diatomic  components.  Called  by  WRITE  when  the  keyword 
ENPART  is  specified.  No  data  are  changed  by  this  call. 

Utility:  Calculates  the  machine  precision  and  dynamic  range 
for  use  by  the  two  diagonalizers. 

Main  Sequence:  ESP  is  not  present  in  the  default  copy  of  MOPAC. 

ESP  calculates  the  atomic  charges  which  would  reproduce  the 
electrostatic  potential  of  the  nuclii  and  electronic  wavefunction. 

Block  Data:  Used  by  the  ESP  calculation,  ESPBLO  fills  two  small 
arrays! 

Utility:  Part  of  the  ESP.  ESP  fits  the  quantum  mechanical 
potential  to  a  classical  point  charge  model 

Utility:  Dedicated  procedure  for  storing  3  parameters  and  one 
array  in  a  store.  Used  by  SEARCH. 

Utility:  Part  of  the  POLAR  calculation.  Evaluates  the 
effect  of  an  electric  field  on  a  molecule. 

Main  Sequence:  Optimizes  a  geometry  by  minimizing  the  energy. 

Makes  use  of  the  first  and  estimated  second  derivatives  to 
achieve  this  end.  Arguments:  Parameters  to  be  optimized, 
(overwritten  on  exit  with  the  optimized  parameters).  Number  of 
parameters,  final  optimized  heat  of  formation.  Called  by  MAIN, 
REACT1,  and  FORCE. 

Utility:  Part  of  CDIAG. 

Utility:  part  of  CDIAG. 

Main  sequence:  Calculates  the  exact  Hessian  matrix  for  a  system 
This  is  done  by  either  using  differences  of  first  derivatives 
(normal  mode)  or  by  four  full  SCF  calculations  (half  electron 
or  C.I.  mode).  Called  by  FORCE. 

Utility:  Adds  on  to  Fock  matrix  the  one-center  two  electron 
terms.  Called  by  ITER  only. 

Utility:  Adds  on  to  Fock  matrix  the  two-center  two  electron 
terms.  Called  by  ITER  and  DERIV.  In  ITER  the  entire  Fock  matrix 
is  filled;  in  DERIV,  only  diatomic  Fock  matrices  are 
constructed. 

Written  out  of  MOPAC  6.00. 

Main  sequence:  Performs  a  force-constant  and  vibrational 
frequency  calculation  on  a  given  system.  If  the  starting 
gradients  are  large,  the  geometry  is  optimized  to  reduce  the 
gradient  norm,  unless  LET  is  specified  in  the  keywords. 
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Isotopic  substitution  is  allowed.  Thermochemical  quantities 
are  calculated.  Called  by  MAIN. 

FORMD  Main  Sequence:  Called  by  EF.  FORMD  constructs  the  next  step 
in  the  geometry  optimization  or  transition  state  location. 

FORMXY  Utility:  Part  of  DIJKL1.  FORMXY  constructs  part  of  the  two- 
electron  integral  over  M.O.'s. 

FORSAV  Utility:  Saves  and  restores  data  used  in  FMAT  in  FORCE 
calculation.  Called  by  FMAT. 

FRAME  Utility:  Applies  a  very  rigid  constraint  on  the  translations 
and  rotations  of  the  system.  Used  to  separate  the  trivial 
vibrations  in  a  FORCE  calculation. 

FREQCY  Main  sequence:  Final  stage  of  a  FORCE  calculation.  Evaluates 
and  prints  the  vibrational  frequencies  and  modes. 

FSUB  Utility:  Part  of  ESP. 

GENUN  Utility:  Part  of  ESP.  Generates  unit  vectors  over  a  sphere, 

called  by  SURFAC  only. 

GEOUT  Utility:  Prints  out  the  current  geometry.  Can  be  called  at 
any  time.  Does  not  change  any  data. 

GEOUTG  Utility:  Prints  out  the  current  geometry  in  Gaussian  Z-matrix 
format. 

GETDAT  Utility:  Reads  in  all  the  datar  and  puts  it  in  a  scratch  file 
on  channel  5. 


GETGEG  Utility:  Reads  in  Gaussian  Z-matrix  geometry.  Equivalent  to 
GETGEO  and  GETSYM  combined. 


GETGEO 

GETSYM 

GETTXT 

GETVAL 

GMETRY 


GOVER 


Utility:  Reads  in  geometry  in  character  mode  from  specified 
channel/  and  stores  parameters  in  arrays.  Some  error-checking 
is  done.  Called  by  READ  and  REACT1 . 


Utility:  Reads  in  symmetry  data.  Used  by  READ. 
Utility:  Reads  in  KEYWRD,  KOMENT  and  TITLE. 


Utility:  Called  by  GETGEG/  GETVAL  either  gets  an  internal 
coordinate  or  a  logical  name  for  that  coordinate. 


Utility:  Fills  the  cartesian  coordinates  array.  Data  are 
supplied  from  the  array  GEO,  GEO  can  be  (a)  in  internal 
coordinates,  or  (b)  in  cartesian  coordinates.  If  STEP  is 
non-zero,  then  the  coordinates  are  modified  in  light  of  the 
other  geometry  and  STEP.  Called  by  HCORE,  DERIV,  READ,  WRITE, 
MOLDAT,  etc. 


Utility:  Calculates  the  overlap  of  two  Slater  orbitals  which 
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have  been  expanded  into  six  gaussians.  Calculates  the 
STP-6G  overlap  integrals. 

Main  Sequence:  Calculates  a  grid  of  points  for  a  2-D  search 
in  coordinate  space.  Useful  when  more  information  is  needed 
about  a  reaction  surface. 

Utility:  Given  any  two  atoms  in  cartesian  space,  HI ELEC 
calculates  the  one-electron  energies  of  the  off-diagonal 
elements  of  the  atomic  orbital  matrix. 

H(i,  j)  =  -S(i, j)*(beta(i)+beta( j) )/2. 

Called  by  HCORE  and  DERIV. 

Utility:  The  symmetry  operation  subroutine,  HADDON  relates  two 
geometric  variables  by  making  one  a  dependent  function  of  the 
other.  Called  by  SYMTRY  only. 

Main  sequence:  Sets  up  the  energy  terms  used  in  calculating  the 
SCF  heat  of  formation.  Calculates  the  one  and  two  electron 
matrices,  and  the  nuclear  energy. 

Called  by  COMPFG. 

Utility:  Given  the  density  matrix,  and  the  one  electron  and 
Fock  matrices,  calculates  the  electronic  energy.  No  data  are 
changed  by  a  call  of  HELECT.  Called  by  ITER  and  DERIV. 

Utility:  Rapid  diagonalization  routine.  Accepts  a  secular 
determinant,  and  produces  a  set  of  eigenvectors  and 
eigenvalues.  The  secular  determinant  is  destroyed. 

Utility:  Fills  the  large  two-electron  array  over  a  M.O. 
basis  set.  Called  by  MECI. 

Utility:  Runs  the  Camp-King  converger,  q.v. 

Main  sequence:  Given  the  one  and  two  electron  matrices,  ITER 
calculates  the  Fock  and  density  matrices,  and  the  electronic 
energy.  Called  by  COMPFG. 

Utility:  Calculates  the  coulomb  contribution  to  the  Fock  matrix 
in  NDDO  formalism.  Called  by  FOCK 2 . 

Utility:  Calculates  the  difference  vector  in  cartesian  coordinates 
corresponding  to  a  small  change  in  internal  coordinates. 

Utility:  Calculates  the  exchange  contribution  to  the  Fock  matrix 
in  NDDO  formalism.  Called  by  FOCK 2 . 

Main  sequence:  Called  by  the  BFGS  geometry  optimized  FLEPO, 

LINMIN  takes  a  step  in  the  search-direction  and  if  the  energy 
drops,  returns.  Otherwise  it  takes  more  steps  until  if  finds 
one  which  causes  the  energy  to  drop. 

Utility:  Given  a  set  of  occupied  eigenvectors,  produces  a 
canonical  set  of  localized  bonding  orbitals,  by  a  series  of 
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2x2  rotations  which  maximize  <psi**4>.  Called 
by  WRITE. 

Main  sequence:  In  a  gradient  minimization,  LOCMIN  does  a  line- 
search  to  find  the  gradient  norm  minimum.  Main  arguments: 
current  geometry,  search  direction,  step,  current  gradient 
norm;  on  exit:  optimized  geometry,  gradient  norm. 

Utility:  Matrix  multiplication.  Two  matrices,  stored  as  lower 
half  triangular  packed  arrays,  are  multiplied  together,  and  the 
result  stored  in  a  third  array  as  the  lower  half  triangular 
array.  Called  from  PULAY. 

Utility:  Matrix  printer.  Prints  a  square  matrix,  and  a 
row-vector,  usually  eigenvectors  and  eigenvalues.  The  indices 
printed  depend  on  the  size  of  the  matrix:  they  can  be  either 
over  orbitals,  atoms,  or  simply  numbers,  thus  M.O.'s  are  over 
orbitals,  vibrational  modes  are  over  numbers.  Called  by  WRITE, 
FORCE. 

Utilities:  Part  of  the  complex  diagonalizer ,  and  called  by 
CDIAG . 

Main  sequence:  Main  function  for  Configuration  Interaction, 

MECI  constructs  the  appropriate  C.I.  matrix,  and  evaluates  the 
roots,  which  correspond  to  the  electronic  energy  of  the  states 
of  the  system.  The  appropriate  root  is  then  returned. 

Called  by  ITER  only. 

Utility:  Constructs  the  differential  C.I.  secular  determinant. 

Utility:  Constructs  the  normal  C.I.  secular  determinant. 

Utility:  Reforms  the  density  matrix  after  a  MECI  calculation. 

Utility:  Called  by  DIIS.  MINV  inverts  the  Hessian  matrix. 

Main  sequence:  MAIN  program.  MNDO  first  reads  in  data  using 
READ,  then  calls  either  FLEPO  to  do  geometry  optimization, 

FORCE  to  do  a  FORCE  calculation,  PATHS  for  a  reaction  with  a 
supplied  coordinate,  NLLSQ  for  a  gradient  minimization  or 
REACT1  for  locating  the  transition  state.  Starts  the  timer. 

Main  Sequence:  Sets  up  all  the  invariant  parameters  used  during 
the  calculation,  e.g.  number  of  electrons,  initial  atomic 
orbital  populations,  number  of  open  shells,  etc.  Called  once  by 
MNDO  only. 

Utility:  Calculates  the  contribution  from  each  M.O.  to  the 
total  valency  in  the  molecule.  Empty  M.O.'s  normally 
have  a  negative  molecular  valency. 

Utility:  Part  of  the  matrix  package.  Multiplies  together  two 
rectangular  packed  arrays,  i.e.,  C  =  A.B. 
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|  MTXMC 
MULLIK 

MULT 

MXM 

|  MXMT 
MYWORD 


NAICAP 

NAICAP 

NLLSQ 

NUCHAR 

OSINV 

OVERLP 

OVLP 

PARSAV 

PARTXY 

PATHK 

PATHS 


|  PDGRID 
PERM 


Utility:  Part  of  the  matrix  package.  Similar  to  MTXM. 

Utility:  Constructs  and  prints  the  Mulliken  Population 
Analysis.  Available  only  for  RHF  calculations.  Called  by 
WRITE. 

Utility:  Used  by  MULLIK  only,  MULT  multiplies  two  square 
matrices  together. 

Utility:  Part  of  the  matrix  package.  Similar  to  MTXM. 

Utility:  Part  of  the  matrix  package.  Similar  to  MTXM. 

Utility:  Called  in  WRTKEY,  MYWORD  checks  for  the  existance  of 
a  specific  string.  If  it  is  found,  MYWORD  is  set  true,  and 
the  all  occurances  of  string  are  deleted.  Any  words 
not  recognised  will  be  flagged  and  the  job  stopped. 

Utility:  Called  by  ESP. 

Utility:  Called  by  ESP. 

Main  sequence:  Used  in  the  gradient  norm  minimization. 

Takes  a  character  string  and  reads  all  the  numbers  in  it 
and  stores  these  in  an  array. 

Utility:  Inverts  a  square  matrix.  Called  by  PULAY  only. 

Utility:  Part  of  EF.  OVERLP  decides  which  normal  mode  to 
follow. 

Utility:  Called  by  ESP  only.  OVLP  calculates  the  overlap 
over  Gaussian  STO's. 

Utility:  Stores  and  restores  data  used  in  the  gradient-norm 
minimization  calculation. 

Utility:  Called  by  IJKL  only,  PARTXY  calculates  the  partial 
product  <i,j  (1/r)  in  <i,j  <l/r)  k,l>. 

Main  sequence:  Calculates  a  reaction  coordinate  which  uses 
a  constant  step-size.  Invoked  by  keywords  STEP  and  POINTS. 

Main  sequence:  Given  a  reaction  coordinate  as  a  row-vector, 
PATHS  performs  a  FLEPO  geometry  optimization  for  each  point, 
the  later  geometries  being  initially  guessed  from  a  knowledge 
of  the  already  optimized  geometries,  and  the  current  step. 
Called  by  MNDO  only. 

Utility:  Part  of  ESP.  Calculates  the  Williams  surface. 

Utility:  Permutes  nl  electrons  of  alpha  or  beta  spin  among 
n2  M.O. 's. 
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POLAR 

POWSAV 

POWSQ 

PRTDRC 

PULAY 


QUADR: 

REACT 1 

READ 

READA 

REFER 

REPP 

ROTAT 


Utility:  Calculates  the  polarizability  volumes  for  a  molecule 
or  ion.  Uses  19  SCF  calculations,  so  appears  after  WRITE  has 
finished.  Cannot  be  used  with  FORCE,  but  can  be  used  anywhere 
else.  Called  by  WRITE. 

Utility:  Calculation  store  and  restart  for  SIGMA 
calculation.  Called  by  POWSQ. 

Main  sequence:  The  Mclver  -  Komornicki  gradient 
minimization  routine.  Constructs  a  full  Hessian  matrix 
and  proceeds  by  line-searches  Called  from  MAIN  when 
SIGMA  is  specified. 

Utility:  Prints  DRC  and  IRC  results  according  to  instructions. 
Output  can  be  (a)  every  point  calculated  (default),  (b)  in 
constant  steps  in  time,  space  or  energy. 

Utility:  A  new  converger.  Uses  a  powerful 

mathematical  non-iterative  method  for  obtaining  the  SCF  Fock 
matrix.  Principle  is  that  at  SCF  the  eigenvectors  of  the  Fock 
and  density  matrices  are  identical,  so  [F.P]  is  a  measure  of 
the  non-self  consistency.  While  very  powerful,  PUL*Y  is  not 
universally  applicable.  Used  by  ITER. 

Utility:  Used  in  printing  the  IRC  -  DRC  results.  Sets  up 
a  quadratic  in  time  of  calculated  quantities  so  that  PRTDRC 
can  select  specific  reaction  times  for  printing. 

Main  sequence:  Uses  reactants  and  products  to  find  the 
transition  state.  A  hypersphere  of  N  dimensions  is  centered  on 
each  moiety,  and  the  radius  steadily  reduced.  The  entity  of 
lower  energy  is  moved,  and  when  the  radius  vanishes,  the 
transition  state  is  reached.  Called  by  MNDO  only. 

Main  sequence:  Almost  all  the  data  are  read  in  through  READ. 
There  is  a  lot  of  data-checking  in  READ,  but  very  little 
calculation.  Called  by  MNDO. 

Utility:  General  purpose  character  number  reader.  Used  to  enter 
numerical  data  in  the  control  line  as  "  <variable>=n.nnn  "  where 
<variable>  is  a  mnemonic  such  as  SCFCRT  or  CHARGE. 

Called  by  READ,  FLEPO,  ITER,  FORCE,  and  many  other  subroutines. 

Utility:  Prints  the  original  references  for  atomic  data. 

If  an  atom  does  not  have  a  reference,  i.e.,  it  has  not  been 
parameterized,  then  a  warning  message  will  be  printed  and 
the  calculation  stopped. 

Utility:  Calculates  the  22  two-electron  reduced  repulsion 
integrals,  and  the  8  electron-nuclear  attraction  integrals. 

These  are  in  a  local  coordinate  system.  Arguments:  atomic 
numbers  of  the  two  atoms,  interatomic  distance,  and  arrays  to 
hold  the  calculated  integrals.  Called  by  ROTATE  only. 

Utility:  Rotates  analytical  two-electron  derivatives  from 
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ROTATE 

RSP 

SAXPY 

SCHMIB 

SCHMIT 

SCOPY 

SDOT 

SEARCH 

SECOND 

SET 

SETUP 3 

SETUPG 

SOLROT 

SORT 

SPACE 

SPCG 

SPLINE 

SS 


atomic  to  molecular  frame. 

Utility:  All  the  two-electron  repulsion  integrals,  the  electron- 
nuclear  attraction  integrals,  and  the  nuclear-nuclear  repulsion 
term  between  two  atoms  are  calculated  here.  Typically  100  two- 
electron  integrals  are  evaluated. 

Utility:  Rapid  diagonalization  routine.  Accepts  a  secular 
determinant,  and  produces  a  set  of  eigenvectors  and 
eigenvalues.  The  secular  determinant  is  destroyed. 

Utility:  Called  by  the  utility  SUPDOT  only! 

Utility:  Part  of  Camp-King  converger. 

Utility:  Part  of  Camp-King  converger. 

Utility:  Copies  an  array  into  another  array. 

Utility:  Forms  the  scalar  of  the  product  of  two  vectors. 

Utility:  Part  of  the  SIGMA  and  NLLSQ  gradient  minimizations. 

The  line-search  subroutine,  SEARCH  locates  the  gradient 
minimum  and  calculates  the  second  derivative  of  the  energy 
in  the  search  direction.  Called  by  POWSQ  and  NLLSQ. 

Utility:  Contains  VAX  specific  code.  Function  SECOND 
returns  the  number  of  CPU  seconds  elapsed  since  an  arbitrary 
starting  time.  If  the  SHUTDOWN  command  has  been  issued, 
the  CPU  time  is  in  error  by  exactly  1,000,000  seconds,  and 
the  job  usually  terminates  with  the  message  "time  exceeded". 

Utility:  Called  by  DIAT2 ,  evaluates  some  terms  used  in  overlap 
calculation. 

Utility:  Sets  up  the  Gaussian  expansion  of  Slater  orbitals 
using  a  STO-3G  basis  set. 

Utility:  Sets  up  the  Gaussian  expansion  of  Slater  orbitals 
using  a  STO-6G  basis  set. 

Utility:  For  Cluster  systems,  adds  all  the  two-electron 
integrals  of  the  same  type,  between  different  unit  cells,  and 
stores  them  in  a  single  array.  Has  no  effect  on  molecules. 

Utility:  Part  of  CDIAG,  the  complex  diagonalizer . 

Utility:  Called  by  DIIS  only. 

Written  out  of  Version  6.00. 

Utility:  Part  of  Camp-King  converger. 

Utility:  An  almost  general  Slater  orbital  overlap  calculation. 
Called  by  DIAT. 
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SUP DOT 
SURFAC 
SWAP 

SYMTRY 

THERMO 


TIMCLK 

TIMER 

TIMOUT 

TQL2 

TQLRAT 

TRBAK3 

TRED3 

UPDATE 

UPDHES 

VECPRT 


WRITE 


WRTKEY 

WRTTXT 

XXX 


Utility:  Matrix  mutiplication  A=B.C 
Utility:  Part  of  the  ESP. 

Utility:  Used  with  FILL-,  SWAP  ensures  that  a  specified 
M.O.  is  filled.  Called  by  ITER  only. 

Utility:  Calculates  values  for  geometric  parameters  from  known 
geometric  parameters  and  symmetry  data.  Called  whenever  6METRY 
is  called. 

Main  sequence:  After  the  vibrational  frequencies  have  been 
calculated,  THERMO  calculates  thermodynamic  quantities  such  as 
internal  energy,  heat  capacity,  entropy,  etc,  for  translational, 
vibrational,  and  rotational,  degrees  of  freedom. 

Utility:  Vax-specific  code  for  determining  CPU  time. 

Utility:  Prints  times  of  various  steps. 

Utility:  Prints  total  CPU  time  in  elegant  format. 

Utility:  Part  of  the  RSP. 

Utility:  Part  of  the  RSP. 

Utility:  Part  of  the  RSP. 

Utility:  Part  of  the  RSP. 

Utility:  Given  a  set  of  new  parameters,  stores  these 

in  their  appropriate  arrays.  Invoked  by  EXTERNAL. 

Utility:  Called  by  EF,  UPDHES  updates  the  Hessian  matrix. 

Utility:  Prints  out  a  packed,  lower-half  triangular  matrix. 

The  labeling  of  the  sides  of  the  matrix  depend  on  the  matrix's 
size:  if  it  is  equal  to  the  number  of  orbitals,  atoms,  or  other. 
Arguments:  The  matrix  to  be  printed,  size  of  matrix.  No  data 
are  changed  by  a  call  of  VECPRT. 

Main  sequence:  Most  of  the  results  are  printed  here.  All 
relevant  arrays  are  assumed  to  be  filled.  A  call  of  WRITE  only 
changes  the  number  of  SCF  calls  made,  this  is  reset  to  zero. 

No  other  data  are  changed.  Called  by  MAIN,  FLEPO,  FORCE. 

Main  Sequence:  Prints  all  keywords  and  checks  for 
compatability  and  to  see  if  any  are  not  recognised. 

WRTKEY  can  stop  the  job  if  any  errors  are  found. 

Main  Sequence:  Writes  out  KEYWRD,  KOMENT  and  TITLE.  The 
inverse  of  GETTXT . 

Utility:  Forms  a  unique  logical  name  for  a  Gaussian  Z-matrix 
logical.  Called  by  GEOUTG  only. 


i 
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XYZINT 

XYZGEO 


Utility:  Converts  from  cartesian  coordinates  into  internal. 

XYZINT  sets  up  its  own  numbering  system,  so  no  connectivity 
is  needed. 
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HEATS  OF  FORMATION  OF  SOME  MNDO,  PM3  AND  AMI  COMPOUNDS 


In  order  to  verify  that  MOPAC  is  working  correctly,  a  large  number 
of  tests  need  to  be  done.  These  take  about  45  minutes  on  a  VAX  11-780, 
and  even  then  many  potential  bugs  remain  undetected.  It  is  obviously 
impractical  to  ask  users  to  test  MOPAC.  However,  users  must  be  able  to 
verify  the  basic  working  of  MOPAC,  and  to  do  this  the  following  tests 
for  the  elements  have  been  provided. 

Each  element  can  be  tested  by  making  up  a  data-file  using  estimated 
geometries  and  running  that  file  using  MOPAC.  The  optimized  geometries 
should  give  rise  to  heats  of  formation  as  shown.  Any  difference  greater 
than  0.1  Kcal/mole  indicates  a  serious  error  in  the  program. 

Caveats 


1.  Geometry  definitions  must  be  correct. 

2.  Heats  of  formation  may  be  too  high  for  certain 
compounds.  This  is  due  to  a  poor  starting  geometry 
trapping  the  system  in  an  excited  state.  (Affects 
IC1  at  times) 
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Element 

Test  Compound 

Heat 

MINDO/3 

of  Formation 
MNDO  AMI 

PM3 

Hydrogen 

CH4 

-6.3 

-11.9 

-8.8 

-13.0 

Lithium 

Beryllium 

LiH 

BeO 

+23.2 
+  38.6 

+53.0 

Boron 

Carbon 

BF3 

CH4 

-270.2 

-6.3 

-261.0 

-11.9 

-272.1* 

-8.8 

-13.0 

Nitrogen 

NH3 

-9.1 

-6.4 

-7.3 

-3.1 

Oxygen 

C02 

-95.7 

-75.1 

-79.8 

-85.0 

Fluorine 

CF4 

-223.9 

-214.2 

-225.7 

-225.1 

Magnesium 

MgF2 

-160.7 

Aluminium 

A1F 

-83.6 

-77.9 

-50.1 

Silicon 

SiH 

+  82.9 

+90.2 

+84.5 

+94.6 

Phosphorus 

PH3 

+  2.5 

+3.9 

+10.2 

+0.2 

Sulfur 

H2S 

-2.6 

+3.8 

+1.2 

-0.9 

Chlorine 

HC1 

-21.1 

-15.3 

-24.6 

-20.5 

Zinc 

ZnMe2 

+19.9 

+19.8 

8.2 

Gallium 

GaC13 

-79.7 

Germanium 

GeF 

-16.4 

-19.7 

-3.3 

Arsenic 

AsH3 

+12.7 

Selenium 

SeC12 

-38.0 

Bromine 

HBr 

+3.6 

-10.5 

+5.3 

Cadmium 

CdC12 

-48.6 

Indium 

InC13 

-72.8 

Tin 

SnF 

-20.4 

-17.5 

Antimony 

SbC13 

-72.4 

Tellurium 

TeH2 

+  23.8 

Iodine 

IC1 

-6.7 

-4.6 

+  10.8 
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Mercury 

HgC12 

-36.9  -44.8 

-32.7 

Thallium 

T1C1 

*-13.4 

Lead 

PbF 

-22.6 

-21.0 

Bismuth 

B1C13 

-42.6 

*  Not  an  exhaustive 

test  of  AMI  boron. 
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C 

N 

O 
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